Michael Sullivan <unixwzrd / mac.com> writes:

> I was wondering how search engines like Google and Yahoo parse
> Japanese text, much less web pages.  There are numerous filters to
> extract text from web pages, but parsing Japanese text is another
> matter altogether.

I'm not sure what you mean by "parsing", but if you mean segmentation
and morphological analysis of Japanese, then two popular packages for
doing this are ChaSen and MeCab.

Best,

John