A modified sentenceNormalizer by gregomm
Features:
1- Removes duplicated question marks, exclamations and periods
2- Capitalize first letter of a sentence.
3- Split sentences not only with "." but also with "?" and "!"
4- Puts a white space at the end of each sentence
5- Retains newlines
--removed from orginal function--
undestand the meaning of "¡" and "¿" in languages like spanish.
undestand the htmlentitity version of this simbols.
--removed from orginal function--
<?php
function sentenceNormalizer($sentence_split) {
$sentence_split = preg_replace(array('/[!]+/','/[?]+/','/[.]+/'),
array('!','?','.'),$sentence_split);
$textbad = preg_split("/(\!|\.|\?|\n)/", $sentence_split,-1,PREG_SPLIT_DELIM_CAPTURE);
$newtext = array();
$count = sizeof($textbad);
foreach($textbad as $key => $string) {
if (!empty($string)) {
$text = trim($string, ' ');
$size = strlen($text);
if ($size > 1){
$newtext[] = ucfirst(strtolower($text));
}
elseif ($size == 1) {
$newtext[] = ($text == "\n") ? $text : $text . ' ';
}
}
}
return implode($newtext);
}
?>
ucwords
(PHP 4, PHP 5)
ucwords — Pone en mayúsculas el primer carácter de cada palabra de una cadena
Descripción
string ucwords
( string $cadena
)
Pasa a mayúsculas la primera letra de cada palabra en cadena si dicho carácter es alfabético.
La forma de determinar qué es una palabra es la siguiente: cualquier cadena de caracteres que esté seguida por un espacio en blanco (los espacios en blanco definidos son: espacio, nueva línea, retorno de carro, tabulador horizontal y tabulador vertical).
Example #1 Ejemplo de ucwords()
<?php
$cadena = 'en un lugar de la mancha';
$cadena = ucwords($cadena); // En Un Lugar De La Mancha
$cadena = 'EN UN LUGAR DE LA MANCHA';
$cadena = ucwords($cadena); // EN UN LUGAR DE LA MANCHA
$cadena = ucwords(strtolower($cadena)); // En Un Lugar De La Mancha
?>
Note: Esta función es segura binariamente.
Vea también strtoupper(), strtolower() y ucfirst().
ucwords
Alex
27-Aug-2008 06:43
27-Aug-2008 06:43
majd87 at gmail dot com
06-Aug-2008 03:58
06-Aug-2008 03:58
I modified Q1712's code (below) to use regular expressions
instead of characters to properly capitalize words that fall
directly after an unwanted character. See his post for details.
This version allows me to use it around html elements, etc.
<?php
function my_ucwords($string){
$invalid_characters = array('"',
'\(',
'\[',
'\/',
'<.*?>',
'<\/.*?>');
foreach($invalid_characters as $regex){
$string = preg_replace('/('.$regex.')/','$1 ',$string);
}
$string=ucwords($string);
foreach($invalid_characters as $regex){
$string = preg_replace('/('.$regex.') /','$1',$string);
}
return $string;
}
?>
Moreover, to get a proper title case, i combine it with this function:
This Function is obtained from:
http://codesnippets.joyent.com/posts/show/716
<?php
function title_case($title) {
$smallwordsarray = array(
'of','a','the','and','an','or','nor','but','is','if','then',
'else','when',
'at','from','by','on','off','for','in','out',
'over','to','into','with'
);
$words = explode(' ', $title);
foreach ($words as $key => $word)
{
if ($key == 0 or !in_array($word, $smallwordsarray))
$words[$key] = $this->my_ucwords(strtolower($word));
}
$newtitle = implode(' ', $words);
return $newtitle;
}
?>
Hope you find it useful.
gregomm at gmail dot com
23-May-2008 09:38
23-May-2008 09:38
An improved of ucsentence. In fact its a function to avoid ugly data entry. It's based on code taken from this page. Can be inproved for better performance, of course...
Features:
1- removes duplicated question marks, an exclamations
2- Capitalize first letter of a sentence.
3- split sentences not only with "." but also with "?" and "!"
4- Puts a white space at the begining of each sentence
5- undestand the meaning of "¡" and "¿" in languages like spanish.
6- undestand the htmlentitity version of this simbols.
function sentenceNormalizer( $sentence_split ) {
$sentence_split = preg_replace("/[!]+/","!",$sentence_split);
$sentence_split = preg_replace("/[¡]+/","¡",$sentence_split);
$sentence_split = preg_replace("/[?]+/","?",$sentence_split);
$sentence_split = preg_replace("/[¿]+/","¿",$sentence_split);
$textbad = preg_split("/(\<[a-zA-Z0-9-]*\>".
"\!(\s)?|\.(\s)?|\?(\s)?|¿(\s)?|¡(\s)?".
"|¿(\s)?|¡(\s)?)/",
$sentence_split,-1,PREG_SPLIT_DELIM_CAPTURE);
$newtext = array();
$count = sizeof($textbad);
$prevStr ="";
for ($i = 0; $i < $count; $i++){
$text = trim($textbad[$i]);
$size = strlen($text);
if ($size>1){
$sentencegood=ucfirst(strtolower($text));
if ($i>0 && $prevStr != '¿' && $prevStr != '¡' && $prevStr !="¿" && $prevStr !="¡"){
$sentencegood =" ".$sentencegood;
}
$newtext[] = $sentencegood;
$prevStr =$text;
}elseif($size==1 ){
if ($i>0 && ($text == '¿' || $text == '¡' || $prevStr =="¿" || $prevStr =="¡") ){
$newtext[] =" ".$text;
}else{
$newtext[] =$text;
}
$prevStr =$text;
}
}
$textgood = implode($newtext);
return $textgood;
}
caio dot ariede at gmail dot com
25-Mar-2008 08:53
25-Mar-2008 08:53
To capitalize e-mail address, I made this function.
<?php
function capitalize_email($email)
{
$domain = strtolower(strstr($email, '@'));
$user = strtolower(substr($email, 0, strrpos($email, '@')));
$user = preg_replace('!((?:\b|_).)!e', 'strtoupper("$1");', $user);
return $user.$domain;
}
echo capitalize_email('blabla.example@example.com');
// Blabla.Example@example.com
?>
NoName
12-Mar-2008 09:03
12-Mar-2008 09:03
For strings with diactrical marks (umlauts, etc.), consider mb_convert_case.
blake at goinoutwest dot com
30-Jan-2008 02:14
30-Jan-2008 02:14
Relating to the mb_ucwords() function posted by Anonymous. In order for this to actually be multi-byte compliant, you would also need to use mb_substr() and mb_strlen() instead of substr and strlen respectively.
Here it is corrected and extended even further to allow multiple word separators and a list of exceptions to correct after title casing. It's a bit tedious and inelegant, but things frequently are when dealing with human languages.
function mb_ucwords($str) {
$exceptions = array();
$exceptions['Hp'] = 'HP';
$exceptions['Ibm'] = 'IBM';
$exceptions['Gb'] = 'GB';
$exceptions['Mb'] = 'MB';
$exceptions['Cd'] = 'CD';
$exceptions['Dvd'] = 'DVD';
$exceptions['Usb'] = 'USB';
$exceptions['Mm'] = 'mm';
$exceptions['Cm'] = 'cm';
// etc.
$separator = array(" ","-","+");
$str = mb_strtolower(trim($str));
foreach($separator as $s){
$word = explode($s, $str);
$return = "";
foreach ($word as $val){
$return .= $s . mb_strtoupper($val{0}) . mb_substr($val,1,mb_strlen($val)-1);
}
$str = mb_substr($return, 1);
}
foreach($exceptions as $find=>$replace){
if (mb_strpos($return, $find) !== false){
$return = str_replace($find, $replace, $return);
}
}
return mb_substr($return, 1);
}
Anonymous
16-Dec-2007 12:09
16-Dec-2007 12:09
Function to do what ucwords is intended to do - just correctly also for international char sets:
function mb_ucwords($s)
{
$s = mb_strtolower(trim($s));
$w = explode(" ", $s);
$return = "";
foreach ($w as $val)
{
$return .= " " . mb_strtoupper($val{0}) . substr($val,1,strlen($val)-1);
}
return trim($return);
}
Building on an earlier snippet here.
kendsnyder at gmail dot com
30-Jul-2007 10:02
30-Jul-2007 10:02
Here is a function to capitalize a last name, accounting for hyphens, apostrophes, "Mc" and "Mac":
<?php
function CapitalizeLastName($name) {
$name = strtolower($name);
$name = join("'", array_map('ucwords', explode("'", $name)));
$name = join("-", array_map('ucwords', explode("-", $name)));
$name = join("Mac", array_map('ucwords', explode("Mac", $name)));
$name = join("Mc", array_map('ucwords', explode("Mc", $name)));
return $name;
}
?>
I speed tested it against functions that used preg_replace() with an "e" modifier, preg_replace_callback(), and a character-by-character parsing. Unexpectedly, this function using join(), array_map() and explode() was fastest.
emailfire at gmail dot com
24-May-2007 01:27
24-May-2007 01:27
To use ucwords with an exception:
<?php
function ucwordss($str, $exceptions) {
$out = "";
foreach (explode(" ", $str) as $word) {
$out .= (!in_array($word, $exceptions)) ? strtoupper($word{0}) . substr($word, 1) . " " : $word . " ";
}
return rtrim($out);
}
?>
For example:
<?php
$string = "my cat is going to the vet";
$ignore = array("is", "to", "the");
echo ucwordss($string, $ignore);
// My Cat is Going to the Vet
?>
Q1712 at online dot ms
05-May-2007 09:49
05-May-2007 09:49
ucwords() only excepts whitespace in front of a word, although some chars like '"' or '(' normally have no space between them and the following word:
<?php
$title = 'ELVIS "THE KING" PRESLEY - (LET ME BE YOUR) TEDDY BEAR';
echo ucwords(strtolower($title));
?>
prints: Elvis "the King" Presley - (let Me Be Your) Teddy Bear
To avoid this i use a small function adding and deleting blanks behind these chars, and using ucwords() in between:
<?php
function my_ucwords($string)
{
$noletters='"([/'; //add more if u need to
for($i=0; $i<strlen($noletters); $i++)
$string = str_replace($noletters[$i], $noletters[$i].' ', $string);
$string=ucwords($string);
for($i=0; $i<strlen($noletters); $i++)
$string = str_replace($noletters[$i].' ', $noletters[$i], $string);
return $string;
}
$title = 'ELVIS "THE KING" PRESLEY - (LET ME BE YOUR) TEDDY BEAR';
echo my_ucwords(strtolower($title));
?>
prints: Elvis "The King" Presley - (Let Me Be Your) Teddy Bear
chris at cmbuckley dot co dot uk
13-Feb-2007 06:16
13-Feb-2007 06:16
To get some sort of title case with lower-case articles, prepositions etc., try something like this (removing the carriage returns in the regular expression):
<?php
function lower_articles($str) {
return preg_replace(
"/(?<=(?<!:|’s)\W)
(A|An|And|At|For|In|Of|On|Or|The|To|With)
(?=\W)/e",
'strtolower("$1")',
$str
);
}
?>
I added the lookbehind (?<!:|’s) because I use this for film titles, where words following those terms should be capitalised (e.g. Lemony Snicket’s A Series of Unfortunate Events, American Pie Presents: The Naked Mile).
barnaby ritchley at exeye dot co dot uk
03-Jan-2007 05:53
03-Jan-2007 05:53
A very easy way to convert to title case:
function titleCase($string)
{
return ucwords(strtolower($string));
}
$myString = "SOME TEXT";
echo titleCase($myString);
//will print, "My Text"
max at phoenixweb dot it
15-Sep-2006 04:35
15-Sep-2006 04:35
I have rewritten a UCSMART function adding a feature to translate special ASCII char (windows occidental ascii charset):
You can edit/add/delete char by use the first two string (be carefull to preserve the order of the string).
Enjoy!
<?
$ASCII_SPC_MIN = "àáâãäåæçèéêëìíîïðñòóôõöùúûüýÿžš";
$ASCII_SPC_MAX = "ÀÁÂÃÄÅÆÇÈÉÊËÌÍÎÏÐÑÒÓÔÕÖÙÚÛÜÝŸŽŠ";
function str2upper($text) {
global $ASCII_SPC_MIN,$ASCII_SPC_MAX;
return strtr(strtoupper($text),$ASCII_SPC_MIN,$ASCII_SPC_MAX);
}
function str2lower($text) {
global $ASCII_SPC_MIN,$ASCII_SPC_MAX;
return strtr(strtolower($text),$ASCII_SPC_MAX,$ASCII_SPC_MIN);
}
function ucsmart($text) {
global $ASCII_SPC_MIN;
return preg_replace(
'/([^a-z'.$ASCII_SPC_MIN.']|^)([a-z'.$ASCII_SPC_MIN.'])/e',
'"$1".str2upper("$2")',
str2lower($text)
);
}
?>
Massimiliano Cuttini
marco at hotelsandgo dot com
09-Sep-2006 06:35
09-Sep-2006 06:35
ucwords that works also with apices: '
for example: "aquila d'abruzzo" became "Aquila d'Abruzzo"
The second part after the // comment can removed! Is optimized for italian language (leave lowercase articles, prepositions, conjunctions) but easily changeable. Notice che "'s"!! for example "hotel 2000's" becames "Hotel 2000's".
function my_ucwords($s)
{
$a=strtolower($s);
$s=ucfirst($a);
for($x=0; $x<strlen($s)-1; $x++)
if(!ctype_alpha($s[$x])) $s[$x+1]=strtoupper($s[$x+1]);
//Lascia minuscoli articoli, preposizioni, congiunzioni
$minuscole=array("il", "lo", "la", "i", "gli", "le", //ARTICOLI DETERMINATIVI
"un", "uno", "una", //ARTICOLI INDETERMINATIVI
"e", "d", "l", "s", "un", //CONGIUNZIONI e CONTRATTI
"di", "a", "da", "in", "con", "su", "per", "tra", "fra", //PREPOSIZIONI SEMPLICI
"del", "dello", "della", "dei", "degli", "delle", //PREPOSIZIONI ARTICOLATE
"a", "al", "allo", "alla", "ai", "agli", "alle",
"da", "dal", "dallo", "dalla", "dai", "dagli", "dalle",
"in", "nel", "nello", "nella", "nei", "negli", "nelle",
"con", "col", "collo", "colla", "coi", "cogli", "colle",
"su", "sul", "sullo", "sulla", "sui", "sugli", "sulle",
"per", "pel", "pei");
foreach($minuscole as $value)
{
$pos=strpos($a, $value);
if( ( $pos>0 && $pos<strlen($s)-1 && !ctype_alpha($a[$pos-1]) && !ctype_alpha($a[$pos+1]) ) //CARATTERE IN MEZZO
|| ( $pos==strlen($s)-1 && !ctype_alpha($a[$pos-1]) ) ) //CASO PARTICOLARE: carattere in fondo. x es: "hotel 2000's"
$s[$pos]=strtolower($s[$pos]);
}
return $s;
}
function my_ucwords_essential($s)
{
$a=strtolower($s);
$s=ucfirst($a);
for($x=0; $x<strlen($s)-1; $x++)
if(!ctype_alpha($s[$x])) $s[$x+1]=strtoupper($s[$x+1]);
return $s;
}
Can be rewritten better, I Know
philip at fcknet dot dk
08-Aug-2006 02:11
08-Aug-2006 02:11
Before noticing this function I made the following function (that does the same as ucwords):
<?php
function firstUpper($string)
{
$string = str_replace(array("Æ","Ø","Å"), array("æ","ø","å"), strtolower($string));
$ord = explode(" ", $string);
$return = "";
foreach ($ord as $val)
{
$return .= " " . str_replace(array("æ","ø","å"), array("Æ","Ø","Å"), strtoupper($val{0})) . substr($val,1,strlen($val)-1);
}
return $return;
}
?>
It also converts Danish letters without using the setlocale function.
starmonkey [at] evolove [dot] net
05-Aug-2006 05:17
05-Aug-2006 05:17
Simple helper function to walk through a nested array of strings and upper case them:
<?php
/**
* Helper function to convert an array of strings to upper case words
*/
function _capitalize($input) {
if(is_array($input)) {
// recurse through array elements (using a reference)
foreach($input as &$value) {
$value = _capitalize($value);
}
return $input;
} elseif(is_string($input)) {
// process this string
return ucwords($input);
} else {
// all other data types, leave alone
return $input;
}
}
?>
Ismet Togay
31-May-2006 02:07
31-May-2006 02:07
Response to arif:
We do not need that long functions. In order to make ucwords() worked properly in Turkish words that contain speacial characters, we can use the following command in our php codes:
setlocale(LC_ALL, 'tr_TR');
This will set locale to Turkish.
lev at phpfox dot com
06-May-2006 10:44
06-May-2006 10:44
In the function ucsmart() posted by ieure at php dot net on 04-Dec-2005 11:57, I found a similar problem in this function to what he found in igua's.
<?php
function ucsmart($text)
{
return preg_replace('/([^a-z]|^)([a-z])/e', '"$1".strtoupper("$2")',
strtolower($text));
}
?>
"igua's code adds a backslash in front of the first single quote for me. This doesn't alter the content in any way other than changing case."
Actually, it did end up changing the content for me (php 5.0.4) in the way that this function escapes a single quotation (apostrophe) in the MIDDLE of a word.
For example:
who's online?
Became:
Who\'s Online?
The fix is simple however, and merely requires fine-tuning the regular expression:
<?php
function ucsmart($text)
{
return preg_replace('/([^a-z\']|^)([a-z])/e', '"$1".strtoupper("$2")',
strtolower($text));
}
?>
(note: while previewing this note before adding it, I am noticing php's website is not correctly displaying the change I made as I wrote it. After the first a-z in the expression, the single quotation should be escaped... If it isn't you will get a parse error! And apoligies if my text here is colored as php code; not my fault!)
This will not escape a single quotation mark which occurs in the middle of a word... Though, you may find that might need to add other characters inside the regular expression if you use other special characters inside your words and if you get funky output.
It's a great expression though! Simple, yet very powerful. Kudos!
arif
29-Mar-2006 10:12
29-Mar-2006 10:12
it can be used for Turkish alphabet.
function strtoupperTR($str){
return strtr($str,
"abcçdefgğhıijklmnoöpqrsştuüvwxyz",
"ABCÇDEFGĞHIİJKLMNOÖPQRSŞTUÜVWXYZ");
}
function strtolowerTR($str){
return strtr($str,
"ABCÇDEFGĞHIİJKLMNOÖPQRSŞTUÜVWXYZ",
"abcçdefgğhıijklmnoöpqrsştuüvwxyz");
}
function ucwordsTR($str)
{
return preg_replace('/([^a-zığüşçö]|^)([a-zığüşçö])/e', '"$1".strtoupperTR("$2")',
strtolowerTR($str));
}
24-Dec-2005 04:34
"ieure at php dot net", your idea is pure poetry!
The function below will standardize the capitalization on people's names and the titles of reports and essays . You may need to adapt the lists in "$all_uppercase" and "$all_lowercase" to suit the data that you are working with.
function my_ucwords($str, $is_name=false) {
// exceptions to standard case conversion
if ($is_name) {
$all_uppercase = '';
$all_lowercase = 'De La|De Las|Der|Van De|Van Der|Vit De|Von|Or|And';
} else {
// addresses, essay titles ... and anything else
$all_uppercase = 'Po|Rr|Se|Sw|Ne|Nw';
$all_lowercase = 'A|And|As|By|In|Of|Or|To';
}
$prefixes = 'Mc';
$suffixes = "'S";
// captialize all first letters
$str = preg_replace('/\\b(\\w)/e', 'strtoupper("$1")', strtolower(trim($str)));
if ($all_uppercase) {
// capitalize acronymns and initialisms e.g. PHP
$str = preg_replace("/\\b($all_uppercase)\\b/e", 'strtoupper("$1")', $str);
}
if ($all_lowercase) {
// decapitalize short words e.g. and
if ($is_name) {
// all occurences will be changed to lowercase
$str = preg_replace("/\\b($all_lowercase)\\b/e", 'strtolower("$1")', $str);
} else {
// first and last word will not be changed to lower case (i.e. titles)
$str = preg_replace("/(?<=\\W)($all_lowercase)(?=\\W)/e", 'strtolower("$1")', $str);
}
}
if ($prefixes) {
// capitalize letter after certain name prefixes e.g 'Mc'
$str = preg_replace("/\\b($prefixes)(\\w)/e", '"$1".strtoupper("$2")', $str);
}
if ($suffixes) {
// decapitalize certain word suffixes e.g. 's
$str = preg_replace("/(\\w)($suffixes)\\b/e", '"$1".strtolower("$2")', $str);
}
return $str;
}
// A name example
print my_ucwords("MARIE-LOU VAN DER PLANCK-ST.JOHN", true);
// Output: Marie-Lou van der Planc-St.John
// A title example
print my_ucwords("to be or not to be");
// Output: "To Be or Not to Be"
ieure at php dot net
04-Dec-2005 11:57
04-Dec-2005 11:57
Whoa guys, tone things down a bit here. No need to loop and implode. This is a one-line solution:
function ucsmart($text)
{
return preg_replace('/([^a-z]|^)([a-z])/e', '"$1".strtoupper("$2")',
strtolower($text));
}
igua's code adds a backslash in front of the first single quote for me. This doesn't alter the content in any way other than changing case.
gothicbunny at hotmail dot com
09-Nov-2005 01:16
09-Nov-2005 01:16
Here is a simple, yet winded, opposite to ucwords.
<?php
/*
# lcwords v1.000
# Convert the first word character to lowercase (opposite to ucwords)
# input string
# return string
*/
function lcwords($string)
{
/* Some temporary variables */
#loop variable
$a = 0;
#store all words in this array to be imploded and returned
$string_new = array();
#create array of all words
$string_exp = explode(" ",$string);
foreach($string_exp as $astring)
{
for($a=0;$a<strlen($astring);$a++)
{
#check that the character we are at {pos $a} is a word
#i.e. if the word was !A the code would fail at !
#then loop to the next character and succeed at A
#check at character position $a
if(preg_match("'\w'",$astring[$a]))
{
$astring[$a] = strtolower($astring[$a]);
#end the loop
break;
}
}
$string_new[] = $astring;
}
#recreate the string from array components using space deliminator
return implode(" ",$string_new);
}
?>
Of course a simplier way would be to use a callback, but I like working with long code :)
21-Oct-2005 01:14
Here's a piece that allows you to use the contents of a directory.. capitalizes the words and make links.. this particular example splits file names at _ and only selects file with .htm extensions (thought you could use any extension and call it using include() or soom such)
ie my_file_name.htm will produce
<a href="my_file_name.htm">My File Name</a>
<?php
$path = "/home/path/to/your/directory";
$mydir = dir($path);
while(($file = $mydir->read()) !== false) {
if(substr($file, -4)=='.htm'){
$trans = array("_" => " ", ".htm" => ""); // creates the editing array
$newlist = strtr($file, $trans); // edits using editing array
echo "<a href=\"".$file."\">".ucwords($newlist)."</a><br>";
}
}
?>
Static Bit
18-Sep-2005 05:01
18-Sep-2005 05:01
// programming/repair -> Programming/Repair
// mcdonald o'neil -> McDonand O'Neil
// art of street -> Art of Street
function NomeProprio($nome)
{
//two space to one
$nome = str_replace(" ", " ", $nome);
$nome = str_replace(" ", " ", $nome);
$nome = str_replace(" ", " ", $nome);
$intervalo = 1;
for ($i=0; $i < strlen($nome); $i++)
{
$letra = substr($nome,$i,1);
if (((ord($letra) > 64) && (ord($letra) < 123)) || ((ord($letra) > 48) && (ord($letra) < 58)))
{
$checa_palavra = substr($nome, $i - 2, 2);
if (!strcasecmp($checa_palavra, 'Mc') || !strcasecmp($checa_palavra, "O'"))
{
$novonome .= strtoupper($letra);
}
elseif ($intervalo)
{
$novonome .= strtoupper($letra);
}
else
{
$novonome .= strtolower($letra);
}
$intervalo=0;
}
else
{
$novonome .= $letra;
$intervalo = 1;
}
}
$novonome = str_replace(" Of ", " of ", $novonome);
$novonome = str_replace(" Da ", " da ", $novonome);
$novonome = str_replace(" De ", " de ", $novonome);
$novonome = str_replace(" Do ", " do ", $novonome);
$novonome = str_replace(" E " , " e " , $novonome);
return $novonome;
}
radley25 at nospam dot spamcop dot net
05-Jul-2005 05:06
05-Jul-2005 05:06
In response to joshuamallory at yahoo dot com:
Using CSS to fix a PHP fault is not the ideal way to solve a problem. CSS is browser dependent and can only be used when the data is presented in a web page. A better fix would be something like this:
<?php
function better_ucwords($string) {
$string = ucwords($string);
$string = preg_replace('#[\\/][a-z]#e', "strtoupper('$0')", $string);
return $string;
}
?>
igua no-spam at coveruniverse dot com
08-Mar-2005 01:30
08-Mar-2005 01:30
The code posted by neil doesn't fully do what is wanted. Try adding some more question marks at the end and it will return a not wanted string.
Below code will uppercase all your words regardless of the delimiter.
<?php
$text = "What?No 'delimiters',shit \"happens\" here.this solves all problems???";
preg_match_all('/[A-Za-z]+|[^A-Za-z]+/', $text, $data);
for ($i = 0; $i < count($data[0]); $i++) {
$data[0][$i] = ucfirst($data[0][$i]);
}
$text = implode("", $data[0]);
print $text;
?>
arjini at gmail dot com
23-Jan-2005 09:20
23-Jan-2005 09:20
Not so much ucwords() related as it is capital letter related. I often use camel casing (as do wikis), I needed a reason to reverse the camel casing.
function unCamelCase($str){
$bits = preg_split('/([A-Z])/',$str,false,PREG_SPLIT_DELIM_CAPTURE);
$a = array();
array_shift($bits);
for($i = 0; $i < count($bits); ++$i)
if($i%2)
$a[] = $bits[$i - 1].$bits[$i];
return $a;
}
print_r(unCamelCase('MyFancyCamelCasedWord'));
Array
(
[0] => My
[1] => Fancy
[2] => Camel
[3] => Cased
[4] => Word
)
joshuamallory at yahoo dot com
15-Nov-2004 05:08
15-Nov-2004 05:08
If you want to format a string like...
<?php
$string = "computer programming/repair";
print ucwords($string);
?>
Output: Computer Programming/repair
Notice the word after the slash (Programming/repair) isn't capitalized. To fix this, use CSS...
<?php
$string = "computer programming/repair";
print '<p style="text-transform:capitalize">';
print ucwords($string);
print '<p>';
?>
babel - nospamplease - sympatico - ca
11-Feb-2004 05:26
11-Feb-2004 05:26
Correction to the code of firewire at itsyourdomain dot com:
preg_replace_callback('/\b(\w)(\w+)?/',
create_function('$a',
'return strtoupper($a[1]) . ((sizeof($a) > 2 ) ?
strtolower($a[2]) : "");'),
'p.s.: hello.this is my string.');
Will work with punctuation as well as spaces.
deepdene at email dot com
10-Dec-2002 08:20
10-Dec-2002 08:20
A function knowing about name case (i.e. caps on McDonald etc)
function name_case($name)
{
$newname = strtoupper($name[0]);
for ($i=1; $i < strlen($name); $i++)
{
$subed = substr($name, $i, 1);
if (((ord($subed) > 64) && (ord($subed) < 123)) ||
((ord($subed) > 48) && (ord($subed) < 58)))
{
$word_check = substr($name, $i - 2, 2);
if (!strcasecmp($word_check, 'Mc') || !strcasecmp($word_check, "O'"))
{
$newname .= strtoupper($subed);
}
else if ($break)
{
$newname .= strtoupper($subed);
}
else
{
$newname .= strtolower($subed);
}
$break=0;
}
else
{
// not a letter - a boundary
$newname .= $subed;
$break=1;
}
}
return $newname;
}
firewire at itsyourdomain dot com
20-Nov-2002 12:13
20-Nov-2002 12:13
For those that want to capitalize based on a regular expression.
print preg_replace_callback('/(\s|^)[a-z]/', create_function('$a', 'return strtoupper($a[0]);'), 'hello this is my string');
This is a quick untested example.
anton at titov dot net
25-Sep-2002 07:56
25-Sep-2002 07:56
for those, who not avoid regular expressions, solution of discussed problem:
$text=preg_replace('/(\W)(\w)/e', '"\\1".strtoupper("\\2")', ucfirst(strtolower($text)));
fille at fukt dot bth dot se
27-Aug-2002 05:04
27-Aug-2002 05:04
[Editor's note: For details on the bug see
http://bugs.php.net/bug.php?id=14655]
This function has a bug, and while waiting for the bug fix, here is a work-around pice of code.
When using international letters, you will get into troubles with the ucwords() function.
Example:
$string="xxxxx" will be "XxxXxx" after beeing processed by ucwords().
To get around it, I wrote some extra code that checks the string once more, and lowercases all letters that is not in the beginning of a word.
$string=ucwords($string);
//Bugfix from here on
for($i=0;$i<strlen($string);$i++)
if((ctype_upper($string[$i]) &&( $string[$i-1]==" " || $i==0 ))!=TRUE)
$string[$i]=strtolower($string[$i]);
Thia code is also an optional way of doing the same work on a string that is totally UPPERCASE.
27-Aug-2002 04:20
Beware of language when using this function to collate personal names! This may not work with some languages and this depends on the current locale!
So it's best to simply use strtoupper() or strtolower(strtoupper()) to collate names for searches in a database. Avoid using strtolower() directly, as it won't collate some characters like the german '' into 'ss'.
Capitalizing names is very language dependant: don't do it on address fields such as city names. Prefer uppercasing, or keep the original case if the string must be displayed to a user!
19-Jan-2002 03:14
This seems to be what people want:
function uc_all($string) {
$temp = preg_split('/(\W)/', $string, -1, PREG_SPLIT_DELIM_CAPTURE );
foreach ($temp as $key=>$word) {
$temp[$key] = ucfirst(strtolower($word));
}
return join ('', $temp);
}
[ed note: fixed the code to be correct]
Julienc at psychologie-fr dot com
04-Nov-2001 04:06
04-Nov-2001 04:06
Its still possible to clean a bit more the previous sample:
$string=strtolower($string); $break=true;
for ($i=0; $i < strlen($string); $i++) { $subed=$string[$i];
if (((ord($subed) > 64) && (ord($subed) < 123)) || ((ord($subed) > 48) && (ord($subed) < 58))) {
if ($break) { $string[$i] = strtoupper($subed); }
$break=false; } else { $break=true; }
}
- Julien
mistcat at mistcat dot com
29-Mar-2001 12:00
29-Mar-2001 12:00
Actually that code would work if you changed this line:
$words[0][] = $lastword;
to
$words[0][] = $lastword[0];
neil at no-spam-ents24 dot com
21-Mar-2001 02:10
21-Mar-2001 02:10
The code posted above by Joerg Krause only works for a string which ends with one of the delimiters. A possible fix is:
<?php
$text = "What?No delimiters,shit happens here.this solves all problems.";
preg_match_all("/(\w+[,. ?])+/U", $text, $words);
preg_match("/(\w+)$/", $text, $lastword);
$words[0][] = $lastword;
foreach($words[0] as $part) $uwords[] = ucfirst($part);
$text = implode("", $uwords);
echo $text;
?>
