I'm creating a simple wordcount program in Java that reads through a directory's text-based files.
However, I keep on getting the error:
java.nio.charset.MalformedInputException: Input length = 1
from this line of code:
BufferedReader reader = Files.newBufferedReader(file,Charset.forName("UTF-8"));
I know I probably get this because I used a Charset
that didn't include some of the characters in the text files, some of which included characters of other languages. But I want to include those characters.
I later learned at the JavaDocs that the Charset
is optional and only used for a more efficient reading of the files, so I changed the code to:
BufferedReader reader = Files.newBufferedReader(file);
But some files still throw the MalformedInputException
. I don't know why.
I was wondering if there is an all-inclusive Charset
that will allow me to read text files with many different types of characters?
Thanks.
ISO-8859-1
and it works well. I think it's for European characters, which is fine. I still don't know whyUTF-16
doesn't work, though. – Tucket