Breaking News
Preserving Data Books From Yesteryear
12/9/2013

Image 1 of 22      Next >

Some data book covers are iconic, like the blue and white from National.
Some data book covers are iconic, like the blue and white from National.

Image 1 of 22      Next >

Return to Article

View Comments: Newest First | Oldest First | Threaded View
<<   <   Page 5 / 10   >   >>
rich.pell
User Rank
Blogger
Re: Lp's And torrent
rich.pell   12/17/2013 9:34:14 PM
"It didn't come with any softyware (because of copyright issues) but pointed me at public domain software (I don't remeber the name offhand) That worked well."

Quite possibly it was the excellent Audacity audio editor.

antedeluvian
User Rank
Blogger
Re: Bandsaw and DjVu
antedeluvian   12/17/2013 9:20:23 PM
NO RATINGS
rj2K000

You could get the spine cut off at a book binder place.

Interesting point. The problem would have been that I would have had to take hundreds of books to the bookbinder place and then bring them back in pieces, to say nothing of the costs. Still it's idea to keep in mind.

antedeluvian
User Rank
Blogger
Re: Lp's And torrent
antedeluvian   12/17/2013 9:17:21 PM
Cesar

 

I just bought one of those turntables with a USB port. It didn't come with any softyware (because of copyright issues) but pointed me at public domain software (I don't remeber the name offhand) That worked well. I then recorded every LP and using the software I attempted to smooth out some of the pops and crackle sometimes wityh more success than others.

It was quite an experience listening to every cut of music that I had, and reflecting on quite how bad some of it was. Also brought back memories and there were even cuts I don't ever remember hearing before.

 

 

Cesar_Rodriguez
User Rank
Rookie
Lp's And torrent
Cesar_Rodriguez   12/17/2013 6:17:35 PM
NO RATINGS
"This was after I had converted all my 60s/70s music on LPs to MP3 files, but that is a story for another day", I hope that day come soon I am planning to do the same with my father's collection, so could be nice have some ideas.

Have you consider to put a torrent of this collection?, with the size of the hard drives and the speed of inernet this days probably some people will like to share this collection.

I am glad that some people take the time and effort to preserve this kind of info, thank you.

rj2k000
User Rank
Rookie
Re: Bandsaw and DjVu
rj2k000   12/16/2013 7:42:35 PM
NO RATINGS
You could get the spine cut off at a book binder place. In getting my thesis bound, they first glued on the spine, then used a saw to make all the edges of the pages be perfectly aligned. It was purpose made for books, so there were no ragged edges left.

Wnderer
User Rank
CEO
Re: The Internet Archive
Wnderer   12/13/2013 5:20:34 PM
NO RATINGS
@antedeluvian

Glad to help. I look forward to browing through your collection.

antedeluvian
User Rank
Blogger
Re: The Internet Archive
antedeluvian   12/13/2013 4:20:37 PM
NO RATINGS
Wnderer

You should check out the Internet Archive.

I did and they will be happy to take the data. I will send it to them in the new year. Thanks for the link.

 

bpaddock
User Rank
Rookie
Re: Bandsaw and DjVu
bpaddock   12/13/2013 12:16:22 PM
NO RATINGS
Using two test PDF files Aubery supplied:

DjVu size saving over PDF via using http://any2djvu.djvu.org :

Converting 5M file to 1.5M file at 300 DPI output resolution:
Original file 5,357,098 bytes; DjVu file 1,582,406 bytes

Converting same 5M file to 4M file at 600 DPI output resolution:
Original file 5,357,098 bytes; DjVu file 3,999,325 bytes

Converting 15M file to 3M file at 300 DPI output resolution:
Original file 16,024,540 bytes; DjVu file 3,242,739 bytes

Converting same 15M file to 7.5M file at 600 DPI output resolution:
Original file 16,024,540 bytes; DjVu file 7,688,352 bytes

There are several PDF to DjVu conversion programs for several OS as well.

antedeluvian
User Rank
Blogger
Re: I'm glad I'm not alone!
antedeluvian   12/13/2013 11:44:57 AM
NO RATINGS
Daryl

Somehow I think we should consider merging all the data, even if it is repeated.

I had a lot of text bleed-through from the other side of the page which confused the OCR. Did you have the same issue. Were you able to resolve it or live with it like me?

There was some bleed through, but it never seemed to confuse the OCR.

 

DARYL.WHITE
User Rank
Rookie
I'm glad I'm not alone!
DARYL.WHITE   12/13/2013 11:39:43 AM
NO RATINGS
At least now I know that I'm not alone in my quest. Like you, I couldn't bear to just throw my library away because I know that as soon as I did, I'll need some tidbit of information from the books. My method started out the same as you using a box-cutter to cut away the covers and cut into sections. That wasn't very time efficient especially when the family chuckled at my quest. I finally purchased a simple low-cost paper cutter. I was able to trim all of the binding glue off the pages and provide a clean, consistent margin for the entire book and still have most of the margin left. I still had to break the books into small stacks of 10-15 sheets; but the margins were all consistent and wouldn't jam in the scanner. I've used both PaperPort and Adobe Acrobat for OCR. If the pages were all straight, then PaperPort would produce a better "text" PDF where it would consider the scan as more text than image and resulted in a smaller PDF. If the pages need straightening, then I always used Adobe which had a much better algorithm. Like you, scanning both sides and then merging was a much better approach especially when pages were missed in the scan. I had the liberty of using the iText library to craft a very simple program to stitch/reverse/combine the PDF files based on the scan particulars. When I scanned data books, I had a lot of text bleed-through from the other side of the page which confused the OCR. Did you have the same issue. Were you able to resolve it or live with it like me?

<<   <   Page 5 / 10   >   >>
Radio
NEXT UPCOMING BROADCAST
EE Times Senior Technical Editor Martin Rowe will interview EMC engineer Kenneth Wyatt.
Top Comments of the Week
Like Us on Facebook

Datasheets.com Parts Search

185 million searchable parts
(please enter a part number or hit search to begin)
EE Times on Twitter
EE Times Twitter Feed
Flash Poll