Skip to main content

Can"t read Arabic text file in Java



i am trying to read arabic text using Java , yet the scanner does not see any elements and thus reading is unsuccessful although LineNumberReader recognizes lines in the text file.





i have tried the same code on English text and it works fine.





i am using netbeans 7.0.1





here is my code :







public class ReadFile {

private int number_of_words;

private File f1;

private String array[][],lines[];

private Scanner scan1;



public ReadFile(String sf1) throws FileNotFoundException

{

f1=new File(sf1);

scan1=new Scanner(f1);



}



public String[][] getA()

{

return array;

}



public void read() throws IOException

{

int counter=0,i=0;



LineNumberReader lnr = new LineNumberReader(new FileReader(f1));

lnr.skip(Long.MAX_VALUE);

number_of_words=lnr.getLineNumber();

array = new String[2][number_of_words];

lines = new String[number_of_words];

while(scan1.hasNext())

{

String temp;

temp=scan1.nextLine();

lines[counter++] = temp;

System.out.println(lines[counter-1]+"\t"+lines.length);



}



Arrays.sort(lines);

counter=0;



while(i<lines.length)

{

String temp = lines[i++];

StringTokenizer tk=new StringTokenizer(temp,"\t");



array[0][counter] = tk.nextToken();

array[1][counter++] = tk.nextToken();

}

}

}




Comments

  1. By default scanner uses system encoding. You need to use correct character encoding while reading data special characters.

    scan1=new Scanner(f1, "UTF-8");


    If UTF-8 didn't work you need to try with arabic specific encoding.

    Here are couple of links may be useful File reading practices and Java supported encodings

    ReplyDelete
  2. Try reading the file with this:

    FileInputStream fis = new FileInputStream(f1);
    LineNumberReader lnr = new LineNumberReader(new InputStreamReader(fis, "UTF-8"));


    You need to use the right Charset when reading the file.

    ReplyDelete
  3. Scanner(System.in, "UTF-8")


    is most probably what you are looking for.

    Cheers, Eugene.

    ReplyDelete

Post a Comment

Popular posts from this blog

Slow Android emulator

I have a 2.67 GHz Celeron processor, 1.21 GB of RAM on a x86 Windows XP Professional machine. My understanding is that the Android emulator should start fairly quickly on such a machine, but for me it does not. I have followed all instructions in setting up the IDE, SDKs, JDKs and such and have had some success in staring the emulator quickly but is very particulary. How can I, if possible, fix this problem?

CCNA 1 Final Exam 2011 latest (hot hot hot)

  Hi! I have been posted content of ccna1 final exam (latest and only question.) I will post the answer and insert image on sunday. If you care, please subscribe your email an become a first person have full test content. Subcribe now  Some question  have not content because this question have images content. So that can you wait for me? SUNDAY 1. A user sees the command prompt: Router(config-if)# . What task can be performed at this mode? Reload the device. Perform basic tests. Configure individual interfaces. Configure individual terminal lines. 2. Refer to the exhibit. Host A attempts to establish a TCP/IP session with host C. During this attempt, a frame was captured with the source MAC address 0050.7320.D632 and the destination MAC address 0030.8517.44C4. The packet inside the captured frame has an IP source address 192.168.7.5, and the destination IP address is 192.168.219.24. At which point in the network was this packet captured? leaving host A leaving ATL leaving...