Everything you always wanted to know about UTF-8 (but never dared to ask)

June 24, 2014 00:00

Presented on June 24th 2014 at the PHP Tour, Lyon, France. http://afup.org/pages/phptourlyon2014/

http://www.joind.in/11233

For any application with even the remotest ambition of international use, the only way to go is to use UTF-8. And even without that ambition, using UTF-8 might still bring you more benefits than you currently realize. Unfortunately most developers at one point or another run into problems implementing UTF-8 and get discouraged. That ends now! In this talk I will cover UTF-8 from the basic linguistics, through client-side aspects to all the steps you need to take to tackle the most common (and some more obscure) issues when using UTF-8 in a database driven web application.

Links:

Slide2: http://intertwingly.net/blog/2004/04/25/utf-8-musings#c1082919794 http://intertwingly.net/blog/2004/04/25/utf-8-musings#c1082929502

Slide 5/6: http://www.ethnologue.com/

Slide 7: http://www.omniglot.com/

Slide 8: http://en.wikipedia.org/wiki/Writing_system

Slide 11: http://geek-and-poke.com/

Slide 12: http://www.unicode.org/charts/

Slide 39: http://sourceforge.net/projects/phputf8

Slide 40: http://www.phpwact.org/php/i18n/utf-8

Slide 44: http://www.php.net/regexp.reference.unicode

Slide 46: http://www.php.net/mbstring

Slide 47: http://www.php.net/iconv

Slide 48: http://www.php.net/intl

Slide 57: http://www.phpwact.org/php/i18n/utf-8/mysql

Slide 62: http://www.poedit.net/


Other interesting links:

http://www.eki.ee/letter/ http://www.bisharat.net/A12N/

http://www.wazu.jp/ http://www.alanwood.net/unicode/ http://en.wikipedia.org/wiki/Unicode_typefaces

http://www.styopkin.com/detailsfreeandeasyfonts_viewer.html http://www.heiner-eichmann.de/software/listfont/listfont.htm

http://www.w3.org/International/techniques/authoring-html http://www.sil.org/iso639-3/codes.asp http://www.iso.org/iso/countrycodes/iso3166codelists/countrynamesandcodeelements http://www.w3.org/International/articles/language-tags/

http://httpd.apache.org/docs/2.4/mod/modcharsetlite.html

http://www.collation-charts.org/ http://www.unicode.org/charts/uca/ http://dev.mysql.com/doc/refman/5.7/en/charset-collation-effect.html

http://www.w3.org/International/questions/qa-utf8-bom http://people.w3.org/rishida/utils/bomtester/ http://www.unicode.org/unicode/faq/utf_bom.html#bom1

Views:
16
Download:

speaker

Juliette

jrfnl

more decks of the speaker