From: Theo Veenker (Theo.Veenker@let.uu.nl)
Date: Thu Aug 11 2005 - 06:31:36 CDT
Ritesh wrote:
> Hi,
>
> Thanks a lot for prompt reply.
>
> Issue is like. We have one application where user can upload a file in
> tab delimited or xls file.
>
> Now we have few user who upload a file which can contain English and
> other language characters(Here it is Arabic).
>
> This files can have different combinations as below,
> 1. File is a UTF-8 and have English and Arabic Characters.
> 2. File is a UTF-16 (LE) and have English and Arabic Characters.
> 3. File is UTF-8 and Have only Arabic Characters
> 4. File is UTF-8 and Have only English Characters
> 5. File is UTF-16 and Have only Arabic Characters
> 6. File is UTF-16 and Have only English Characters
> 7. File can be in ASCII format.
>
> Once the file is uploaded it will be displayed on the browser.
>
> Now, we are using following logic while uploading file.
[snip]
Did you check this one, it is a java port of mozilla's automatic charset
detection algorithm. The original C++ sources are provided as well.
http://www.i18nfaq.com/chardet.html
Theo
This archive was generated by hypermail 2.1.5 : Thu Aug 11 2005 - 06:33:14 CDT