On 11/29/05, Jeffrey Schwab <jeff / schwabcenter.com> wrote: > basi wrote: > > Looking for ideas on how to split a text file into sentences. I see the > > problem of basing the split on [.!?] -- they're also used in ways other > > than to end a sentence. If I have to do manual pre-processing of the > > text file, what editing might I do? Has anyone had to deal with this > > problem and how did you make life easier for you? > It's a common convention to separate sentences by double spaces. I > started following this convention because Emacs expected it, and now I > use it always. As I noted above, this is an improper convention outside of the typewriter realm. If you are using anything other than a fixed-pitch font for display or print, you should *never* use two spaces. -austin -- Austin Ziegler * halostatue / gmail.com * Alternate: austin / halostatue.ca