Good Word doc -> plain text conversion
John Abreau
abreauj-Re5JQEeQqe8AvxtiuMwx3w at public.gmane.org
Mon Sep 20 09:40:26 EDT 2010
Did you try the -w option? From the man page:
-w width
In text mode this is the line width in characters. A value
of
zero puts an entire paragraph on a line, useful when the text
is
to used as input for another wordprocessor. This value
is
ignored in PostScript mode.
On Mon, Sep 20, 2010 at 9:30 AM, Ian Stokes-Rees <
ijstokes-/2FeUQLD3jedFdvTe/nMLpVzexx5G7lz at public.gmane.org> wrote:
>
>
> On 9/20/10 12:01 AM, jc-8FIgwK2HfyJMuWfdjsoA/w at public.gmane.org wrote:
> > Dan Ritter wrote:
> > | antiword is the usual candidate. Every one of Google's first ten
> > | results for that are relevant.
> >
> > Yeah, I thought of that, too, but I was hoping there might be something
> that
> > does a better job. In one of my current sample .doc files, for
> example,
> > antiword produces the curious table entry:
>
>
> Use antiword and recompile it yourself with no line length limit. I'm
> sure you'll easily find some hard-coded value of 138 in there.
>
> "antiword" is the standard and will, I suspect, support more than
> anything else you find.
>
> Ian
> _______________________________________________
> Discuss mailing list
> Discuss-mNDKBlG2WHs at public.gmane.org
> http://lists.blu.org/mailman/listinfo/discuss
>
--
John Abreau / Executive Director, Boston Linux & Unix
GnuPG KeyID: 0xD5C7B5D9 / Email: abreauj-Re5JQEeQqe8AvxtiuMwx3w at public.gmane.org
GnuPG FP: 72 FB 39 4F 3C 3B D6 5B E0 C8 5A 6E F1 2C BE 99
More information about the Discuss
mailing list