Is there way to check charset encoding of .txt file with Java?
Asked Answered
Z

2

7

Is there way to check is text file (.txt) encoded with Unicode or UTF-8 with Java?

Zebe answered 13/6, 2012 at 20:4 Comment(0)
W
11

You cannot know with absolute certainty which charset is used in the general case. I found this to be a good read.

http://illegalargumentexception.blogspot.co.uk/2009/05/java-rough-guide-to-character-encoding.html

Especially the section Automatic detection of encoding.

Withindoors answered 13/6, 2012 at 20:15 Comment(0)
B
2

Uhm, theoretically, how would you know if it is unicode?

This is the real question. Truthfully, you cannot know, but you can make a decent guess.

See: Java : How to determine the correct charset encoding of a stream for more details. :)

Broeker answered 13/6, 2012 at 20:11 Comment(0)

© 2022 - 2024 — McMap. All rights reserved.