Austin Ziegler <halostatue / gmail.com> writes: > On 11/29/05, Jeffrey Schwab <jeff / schwabcenter.com> wrote: >> basi wrote: >> > Looking for ideas on how to split a text file into sentences. I see the >> > problem of basing the split on [.!?] -- they're also used in ways other >> > than to end a sentence. If I have to do manual pre-processing of the >> > text file, what editing might I do? Has anyone had to deal with this >> > problem and how did you make life easier for you? >> It's a common convention to separate sentences by double spaces. I >> started following this convention because Emacs expected it, and now I >> use it always. > > As I noted above, this is an improper convention outside of the > typewriter realm. If you are using anything other than a fixed-pitch > font for display or print, you should *never* use two spaces. Alternatively, use text processing systems that do the "right thing"; i.e. transform two spaces into one (e.g. TeX, HTML-based products). There is no good reason a text processor should show two spaces after each other in print. > -austin -- Christian Neukirchen <chneukirchen / gmail.com> http://chneukirchen.org