[DeTomaso] any reason to keep old monthly newsletters and Profiles?
Julian Kift
julian_kift at hotmail.com
Mon Oct 8 15:38:30 EDT 2018
I wrote virtually the same thing to Terry earlier and forgot to copy the list;
Agreed, but making PDF's fully searchable is not quite as straightforward, or it would have been done. I'm not sure whether all the archives are Word converted to PDF documents or scanned copies of originals, the latter makes it a step harder again and you need an OCR image search capability.
The first step would probably be a fully searchable index, that at least identifies where the relevant search articles are located. I'm sure that part is not rocket science, but either way Terry is your man!
I just checked and it looks like everything pre-2013 is scanned in, later is converted/saved from Word document.
Julian
________________________________
From: DeTomaso <detomaso-bounces at server.detomasolist.com> on behalf of Himes, Terry (397C) via DeTomaso <detomaso at server.detomasolist.com>
Sent: Monday, October 8, 2018 12:32 PM
To: Michael Cox; detomaso at server.detomasolist.com
Subject: Re: [DeTomaso] any reason to keep old monthly newsletters and Profiles?
Eeeech. I hope not. But, yeah, that would make double-trouble. I gotta get my hands on them
to see fer sur.
"A Purple Heart proves you were smart enough to hatch a plan,
stupid enough to try it and lucky enough to survive!"
Terry W. Himes
JPL Jet Propulsion Laboratory
Dawn Spacecraft Team
Juno Systems & Software Team
TGO Sequence Lead
Phone: (818) 393-6261
Cell: (818) 653-8213
thimes at jpl.nasa.gov<mailto:thimes at jpl.nasa.gov>
🇺🇸
From: Michael Cox <coxmichaelt at gmail.com>
Date: Monday, October 8, 2018 at 12:20 PM
To: Terry Himes <Terry.Himes at jpl.nasa.gov>, "detomaso at server.detomasolist.com" <detomaso at server.detomasolist.com>
Subject: Re: [DeTomaso] any reason to keep old monthly newsletters and Profiles?
Terry volunteered:
> I probably can do that. I do it every day for Terabytes of telemetry data.
> Right now I am converting 6TB of binary (channelized engineering telemetry) into
> readable and searchable text data. The 6TB will balloon to over 60TB.
>
> If your data is text, or has any metadata, it would be much easier. MySQL would
> be easy, but either ElasticSearch or DynamoDB (nosql) would be better, maybe.
>
> Of course, I?ve offered to help before?. only Asa Jay has taken me up on it.
>
> Terry
Since the docs are scanned I would bet they are images compiled into a PDF. You'd
have to crank up your fancy OCR software first. :^)
--michael cox
-------------- next part --------------
I wrote virtually the same thing to Terry earlier and forgot to copy
the list;
Agreed, but making PDF's fully searchable is not quite as
straightforward, or it would have been done. I'm not sure whether
all the archives are Word converted to PDF documents or scanned copies
of originals, the latter makes it a step harder again and you need an
OCR image search capability.
The first step would probably be a fully searchable index, that at
least identifies where the relevant search articles are located. I'm
sure that part is not rocket science, but either way Terry is your man!
I just checked and it looks like everything pre-2013 is scanned in,
later is converted/saved from Word document.
Julian
__________________________________________________________________
From: DeTomaso <detomaso-bounces at server.detomasolist.com> on behalf of
Himes, Terry (397C) via DeTomaso <detomaso at server.detomasolist.com>
Sent: Monday, October 8, 2018 12:32 PM
To: Michael Cox; detomaso at server.detomasolist.com
Subject: Re: [DeTomaso] any reason to keep old monthly newsletters and
Profiles?
Eeeech. I hope not. But, yeah, that would make double-trouble. I gotta
get my hands on them
to see fer sur.
"A Purple Heart proves you were smart enough to hatch a plan,
stupid enough to try it and lucky enough to survive!"
Terry W. Himes
JPL Jet Propulsion Laboratory
Dawn Spacecraft Team
Juno Systems & Software Team
TGO Sequence Lead
Phone: (818) 393-6261
Cell: (818) 653-8213
thimes at jpl.nasa.gov<[1]mailto:thimes at jpl.nasa.gov>
From: Michael Cox <coxmichaelt at gmail.com>
Date: Monday, October 8, 2018 at 12:20 PM
To: Terry Himes <Terry.Himes at jpl.nasa.gov>,
"detomaso at server.detomasolist.com" <detomaso at server.detomasolist.com>
Subject: Re: [DeTomaso] any reason to keep old monthly newsletters and
Profiles?
Terry volunteered:
> I probably can do that. I do it every day for Terabytes of telemetry
data.
> Right now I am converting 6TB of binary (channelized engineering
telemetry) into
> readable and searchable text data. The 6TB will balloon to over
60TB.
>
> If your data is text, or has any metadata, it would be much easier.
MySQL would
> be easy, but either ElasticSearch or DynamoDB (nosql) would be
better, maybe.
>
> Of course, I?ve offered to help before?. only Asa Jay has taken me
up on it.
>
> Terry
Since the docs are scanned I would bet they are images compiled into a
PDF. You'd
have to crank up your fancy OCR software first. :^)
--michael cox
References
1. mailto:thimes at jpl.nasa.gov
More information about the DeTomaso
mailing list