[DeTomaso] any reason to keep old monthly newsletters and Profiles?
Julian Kift
julian_kift at hotmail.com
Tue Oct 9 13:25:24 EDT 2018
Well they go back to 1972, and it's a monthly newsletter (not including the Profiles here), so at a guess roughly 450 as I think some issues are missing.
Julian
________________________________
From: Himes, Terry (397C) <Terry.Himes at jpl.nasa.gov>
Sent: Tuesday, October 9, 2018 10:21 AM
To: Jeff Detrich; julian_kift at hotmail.com
Cc: coxmichaelt at gmail.com; detomaso at server.detomasolist.com
Subject: Re: [DeTomaso] any reason to keep old monthly newsletters and Profiles?
Ok. How many scanned PDFs (pre-2013) are there??
Can I get a sample set to review and/or test??
Terry
"A Purple Heart proves you were smart enough to hatch a plan,
stupid enough to try it and lucky enough to survive!"
Terry W. Himes
JPL Jet Propulsion Laboratory
Dawn Spacecraft Team
Juno Systems & Software Team
TGO Sequence Lead
Phone: (818) 393-6261
Cell: (818) 653-8213
thimes at jpl.nasa.gov<mailto:thimes at jpl.nasa.gov>
🇺🇸
From: Jeff Detrich <jjdetrich at gmail.com>
Date: Tuesday, October 9, 2018 at 10:15 AM
To: "julian_kift at hotmail.com" <julian_kift at hotmail.com>
Cc: "coxmichaelt at gmail.com" <coxmichaelt at gmail.com>, Terry Himes <Terry.Himes at jpl.nasa.gov>, "detomaso at server.detomasolist.com" <detomaso at server.detomasolist.com>
Subject: Re: [DeTomaso] any reason to keep old monthly newsletters and Profiles?
Originally, we were going to use tags, ie. metadata, to identify what was in an article. Then those tags would be used to search the articles. Tags actually work better than general google type searches. When you google, any word that matches in an article would be listed as a match. So if you were looking for manifold info using google almost every article would show as a match.
To use tags, you might need 20 or more tags to describe what the article was about. It could be general functional area, down to specifics like the brake pads used or motor oil used. Also would include the year and model of the car. Attached is a doc giving ideas. Getting to the articles would use a Boolean search (comma to separate the key words) and then you'd get back a list of the articles with matches.
The problem with this is that someone would have to go thru each of the articles and input the words.
Jeff
On Mon, Oct 8, 2018 at 2:38 PM Julian Kift <julian_kift at hotmail.com<mailto:julian_kift at hotmail.com>> wrote:
I wrote virtually the same thing to Terry earlier and forgot to copy
the list;
Agreed, but making PDF's fully searchable is not quite as
straightforward, or it would have been done. I'm not sure whether
all the archives are Word converted to PDF documents or scanned copies
of originals, the latter makes it a step harder again and you need an
OCR image search capability.
The first step would probably be a fully searchable index, that at
least identifies where the relevant search articles are located. I'm
sure that part is not rocket science, but either way Terry is your man!
I just checked and it looks like everything pre-2013 is scanned in,
later is converted/saved from Word document.
Julian
__________________________________________________________________
From: DeTomaso <detomaso-bounces at server.detomasolist.com<mailto:detomaso-bounces at server.detomasolist.com>> on behalf of
Himes, Terry (397C) via DeTomaso <detomaso at server.detomasolist.com<mailto:detomaso at server.detomasolist.com>>
Sent: Monday, October 8, 2018 12:32 PM
To: Michael Cox; detomaso at server.detomasolist.com<mailto:detomaso at server.detomasolist.com>
Subject: Re: [DeTomaso] any reason to keep old monthly newsletters and
Profiles?
Eeeech. I hope not. But, yeah, that would make double-trouble. I gotta
get my hands on them
to see fer sur.
"A Purple Heart proves you were smart enough to hatch a plan,
stupid enough to try it and lucky enough to survive!"
Terry W. Himes
JPL Jet Propulsion Laboratory
Dawn Spacecraft Team
Juno Systems & Software Team
TGO Sequence Lead
Phone: (818) 393-6261
Cell: (818) 653-8213
thimes at jpl.nasa.gov<mailto:thimes at jpl.nasa.gov><[1]mailto:thimes at jpl.nasa.gov<mailto:thimes at jpl.nasa.gov>>
From: Michael Cox <coxmichaelt at gmail.com<mailto:coxmichaelt at gmail.com>>
Date: Monday, October 8, 2018 at 12:20 PM
To: Terry Himes <Terry.Himes at jpl.nasa.gov<mailto:Terry.Himes at jpl.nasa.gov>>,
"detomaso at server.detomasolist.com<mailto:detomaso at server.detomasolist.com>" <detomaso at server.detomasolist.com<mailto:detomaso at server.detomasolist.com>>
Subject: Re: [DeTomaso] any reason to keep old monthly newsletters and
Profiles?
Terry volunteered:
> I probably can do that. I do it every day for Terabytes of telemetry
data.
> Right now I am converting 6TB of binary (channelized engineering
telemetry) into
> readable and searchable text data. The 6TB will balloon to over
60TB.
>
> If your data is text, or has any metadata, it would be much easier.
MySQL would
> be easy, but either ElasticSearch or DynamoDB (nosql) would be
better, maybe.
>
> Of course, I?ve offered to help before?. only Asa Jay has taken me
up on it.
>
> Terry
Since the docs are scanned I would bet they are images compiled into a
PDF. You'd
have to crank up your fancy OCR software first. :^)
--michael cox
References
1. mailto:thimes at jpl.nasa.gov<mailto:thimes at jpl.nasa.gov>
_______________________________________________
Detomaso Email List is not managed by POCA
Posted emails must not exceed 1.5 Megabytes
DeTomaso mailing list
DeTomaso at server.detomasolist.com<mailto:DeTomaso at server.detomasolist.com>
http://server.detomasolist.com/mailman/listinfo/detomaso
To manage your subscription (change email address, unsubscribe, etc.) use the links above.
Members who post to this list grant license to the list to forward any message posted here to all past, current, or future members of the list. They also grant the list owner permission to maintain an archive or approve the archiving of list messages.
-------------- next part --------------
Well they go back to 1972, and it's a monthly newsletter (not including
the Profiles here), so at a guess roughly 450 as I think some issues
are missing.
Julian
__________________________________________________________________
From: Himes, Terry (397C) <Terry.Himes at jpl.nasa.gov>
Sent: Tuesday, October 9, 2018 10:21 AM
To: Jeff Detrich; julian_kift at hotmail.com
Cc: coxmichaelt at gmail.com; detomaso at server.detomasolist.com
Subject: Re: [DeTomaso] any reason to keep old monthly newsletters and
Profiles?
Ok. How many scanned PDFs (pre-2013) are there??
Can I get a sample set to review and/or test??
Terry
"A Purple Heart proves you were smart enough to hatch a plan,
stupid enough to try it and lucky enough to survive!"
Terry W. Himes
JPL Jet Propulsion Laboratory
Dawn Spacecraft Team
Juno Systems & Software Team
TGO Sequence Lead
Phone: (818) 393-6261
Cell: (818) 653-8213
[1]thimes at jpl.nasa.gov
From: Jeff Detrich <jjdetrich at gmail.com>
Date: Tuesday, October 9, 2018 at 10:15 AM
To: "julian_kift at hotmail.com" <julian_kift at hotmail.com>
Cc: "coxmichaelt at gmail.com" <coxmichaelt at gmail.com>, Terry Himes
<Terry.Himes at jpl.nasa.gov>, "detomaso at server.detomasolist.com"
<detomaso at server.detomasolist.com>
Subject: Re: [DeTomaso] any reason to keep old monthly newsletters and
Profiles?
Originally, we were going to use tags, ie. metadata, to identify what
was in an article. Then those tags would be used to search the
articles. Tags actually work better than general google type searches.
When you google, any word that matches in an article would be listed as
a match. So if you were looking for manifold info using google almost
every article would show as a match.
To use tags, you might need 20 or more tags to describe what the
article was about. It could be general functional area, down to
specifics like the brake pads used or motor oil used. Also
would include the year and model of the car. Attached is a doc giving
ideas. Getting to the articles would use a Boolean search (comma to
separate the key words) and then you'd get back a list of the articles
with matches.
The problem with this is that someone would have to go thru each of the
articles and input the words.
Jeff
On Mon, Oct 8, 2018 at 2:38 PM Julian Kift <[2]julian_kift at hotmail.com>
wrote:
I wrote virtually the same thing to Terry earlier and forgot to
copy
the list;
Agreed, but making PDF's fully searchable is not quite as
straightforward, or it would have been done. I'm not sure whether
all the archives are Word converted to PDF documents or scanned
copies
of originals, the latter makes it a step harder again and you
need an
OCR image search capability.
The first step would probably be a fully searchable index, that
at
least identifies where the relevant search articles are located.
I'm
sure that part is not rocket science, but either way Terry is
your man!
I just checked and it looks like everything pre-2013 is scanned
in,
later is converted/saved from Word document.
Julian
__________________________________________________________________
From: DeTomaso <[3]detomaso-bounces at server.detomasolist.com> on
behalf of
Himes, Terry (397C) via DeTomaso
<[4]detomaso at server.detomasolist.com>
Sent: Monday, October 8, 2018 12:32 PM
To: Michael Cox; [5]detomaso at server.detomasolist.com
Subject: Re: [DeTomaso] any reason to keep old monthly
newsletters and
Profiles?
Eeeech. I hope not. But, yeah, that would make double-trouble. I
gotta
get my hands on them
to see fer sur.
"A Purple Heart proves you were smart enough to hatch a plan,
stupid enough to try it and lucky enough to survive!"
Terry W. Himes
JPL Jet Propulsion Laboratory
Dawn Spacecraft Team
Juno Systems & Software Team
TGO Sequence Lead
Phone: (818) 393-6261
Cell: (818) 653-8213
[6]thimes at jpl.nasa.gov<[1]mailto:[7]thimes at jpl.nasa.gov>
From: Michael Cox <[8]coxmichaelt at gmail.com>
Date: Monday, October 8, 2018 at 12:20 PM
To: Terry Himes <[9]Terry.Himes at jpl.nasa.gov>,
"[10]detomaso at server.detomasolist.com"
<[11]detomaso at server.detomasolist.com>
Subject: Re: [DeTomaso] any reason to keep old monthly
newsletters and
Profiles?
Terry volunteered:
> I probably can do that. I do it every day for Terabytes of
telemetry
data.
> Right now I am converting 6TB of binary (channelized
engineering
telemetry) into
> readable and searchable text data. The 6TB will balloon to
over
60TB.
>
> If your data is text, or has any metadata, it would be much
easier.
MySQL would
> be easy, but either ElasticSearch or DynamoDB (nosql) would be
better, maybe.
>
> Of course, I?ve offered to help before?. only Asa Jay has
taken me
up on it.
>
> Terry
Since the docs are scanned I would bet they are images compiled
into a
PDF. You'd
have to crank up your fancy OCR software first. :^)
--michael cox
References
1. mailto:[12]thimes at jpl.nasa.gov
_______________________________________________
Detomaso Email List is not managed by POCA
Posted emails must not exceed 1.5 Megabytes
DeTomaso mailing list
[13]DeTomaso at server.detomasolist.com
[14]http://server.detomasolist.com/mailman/listinfo/detomaso
To manage your subscription (change email address, unsubscribe,
etc.) use the links above.
Members who post to this list grant license to the list to forward
any message posted here to all past, current, or future members of
the list. They also grant the list owner permission to maintain an
archive or approve the archiving of list messages.
References
1. mailto:thimes at jpl.nasa.gov
2. mailto:julian_kift at hotmail.com
3. mailto:detomaso-bounces at server.detomasolist.com
4. mailto:detomaso at server.detomasolist.com
5. mailto:detomaso at server.detomasolist.com
6. mailto:thimes at jpl.nasa.gov
7. mailto:thimes at jpl.nasa.gov
8. mailto:coxmichaelt at gmail.com
9. mailto:Terry.Himes at jpl.nasa.gov
10. mailto:detomaso at server.detomasolist.com
11. mailto:detomaso at server.detomasolist.com
12. mailto:thimes at jpl.nasa.gov
13. mailto:DeTomaso at server.detomasolist.com
14. http://server.detomasolist.com/mailman/listinfo/detomaso
More information about the DeTomaso
mailing list