[Discuss] Now you see it now you don't
John Abreau
abreauj at gmail.com
Mon Nov 27 16:02:54 EST 2023
The command-line tool "pdftotext" will extract text from a PDF file, and
"pdfimages" will extract images from a PDF file. Both tools are in the rpm
package "poppler-utils".
If you're using debian or ubuntu, it's possible that the .deb package is
named differently, but I assume it's available on those distributions.
On Mon, Nov 27, 2023 at 2:42 PM Rich Pieri <richard.pieri at gmail.com> wrote:
> On Mon, 27 Nov 2023 09:55:04 -0800
> Kent Borg <kentborg at borg.org> wrote:
>
> > > and that any attempt to read the raw text of the email had been
> > > blocked in Thunderbird.
> > They manage to disable "View"->"Message Source Ctrl+U"? That is
> > impressive.
>
> If they buried the whole thing in the PDF file then there is no raw
> message text. And never mind that this violates all the mail handling
> standards and never mind the ADA.
>
> Anywho, it's entirely possible that there is no text at all, and the
> PDF is bitmap image(s). A simple PDF viewer like Sumatra, which doesn't
> have a JavaScript interpreter, should make this apparent, or that it's
> all embedded JavaScript.
>
> --
> \m/ (--) \m/
> _______________________________________________
> Discuss mailing list
> Discuss at lists.blu.org
> http://lists.blu.org/mailman/listinfo/discuss
>
--
John Abreau / Executive Director, Boston Linux & Unix
Email: abreauj at gmail.com / WWW http://www.abreau.net / PGP-Key-ID 0x920063C6
PGP-Key-Fingerprint A5AD 6BE1 FEFE 8E4F 5C23 C2D0 E885 E17C 9200 63C6
More information about the Discuss
mailing list