Skip to main content

How to index file names (and other file metadata) in nutch?



It seems like nutch indexes only (some) parse results. It runs the indexing filters which detremine what is indexed.





These indexing filters get a Parse result as a parameter.





How can I achieve file names and other file metadata like owner being indexed?





Of course I need to add an indexing filter, but to do I also have to add a parser for parsing all filetypes and getting their metadata?


Comments

Popular posts from this blog

Slow Android emulator

I have a 2.67 GHz Celeron processor, 1.21 GB of RAM on a x86 Windows XP Professional machine. My understanding is that the Android emulator should start fairly quickly on such a machine, but for me it does not. I have followed all instructions in setting up the IDE, SDKs, JDKs and such and have had some success in staring the emulator quickly but is very particulary. How can I, if possible, fix this problem?

Java Urban Myths

Along the line of C++ Urban Myths and Perl Myths : What are the Java Urban Myths? That is, the ideas and conceptions about Java that are common but have no actual roots in reality . As a Java programmer, what ideas held by your fellow Java programmers have you had to disprove so often that you've come to believe they all learned at the feet of the same drunk old story-teller? Ideally, you would express these myths in a single sentence, and include an explanation of why they are false.