AigaionWikiMain Page | About | Help | FAQ | Special pages | Log in

Printable version | Disclaimers | Privacy policy

UsingAigaion/Bibtex2Utf8

From AigaionWiki

< UsingAigaion

We have many users who work with BiBTeX for handling their bibliographic references in papers. We also have many users who do not use BiBTeX for their papers. Furthermore, a lot of bibliographic data found on the web is in BiBTeX format - even users who do not use BiBTeX for their papers may want to import these entries.

An important issue with BiBTeX is that most BiBTeX data out there does not contain UTF8 characters such as é, but instead use LaTeX codes for such special characters (in this case {\'{e}}).

But of course we would not want these codes to end up in our database -- they look ugly, and also would annoy all non-BiBTeX users because they cannot use such data in their own editor.

Aigaion contains an extensive library for converting up and down between UTF8 encoding and BiBTeX encoding (for example, between Iñtërnâtiônàlizætiøn and I{\~{n}}t{\"{e}}rn{\^{a}}ti{\^{o}}n{\`{a}}liz{\ae}ti{\o}n).

Contents

How to use these conversions

The bibtex2utf8 conversions occur whenever you input an author name or new publication data, through a form or through bibtex import: BiBTeX codes for special characters are converted to their utf8 equivalent and stored as such in the database.

Upon export of any data to BiBTeX, the conversions are applied in the reverse direction: all supported characters are converted from utf8 to their correct BiBTeX codes.

If you export to a non-BiBTeX format, the UTF8 characters simply remain as they are.

One can input BiBTeX codes for special characters => The BiBTeX codes will be converted to UTF8 for storage => For BiBTeX export, the UTF8 characters will be converted to BiBTeX codes again

Wrong conversions in the latest release

Here you can add a list of character conversions that are included in Aigaion, but are wrong. For missing conversions, see below.

Suggestions for extension with new characters

Here you can add a list of character conversions that are missing in the latest release of Aigaion.

Unit testing

The controller "<aigaionroot>/index.php/test" runs a number of unit tests, among which those for the conversions from bibtex to utf8. Appending "/verbose" to the url will give more output. As you can see, there are a few conversions for which the unit tests are in place, but that are not yet supported in the library.

Acknowledgements

This module owes a lot to the ceaseless testing of Peter Mosses, who stayed optimistics throughout all the conversion bugs and inconsistencies he found.

Retrieved from "http://wiki.aigaion.nl/index.php/UsingAigaion/Bibtex2Utf8"

This page has been accessed 1,238 times. This page was last modified 14:52, 1 February 2009.


Browse
Features

Using Aigaion
Modifying Aigaion
Translating Aigaion
Integrating Aigaion

Install

About
FAQ

Donate

This page
View source

Printable version

Page history
What links here
Related changes

My pages
Log in / create account

Special pages
New pages
Recent changes
File list
Statistics
More...
Find