Upcoming Games

No games to display

Full list
Add a game

Forum

Warrington Overlap Today at 13:53 - Soton_Speed
Notification of support ending f... Today at 12:53 - njimiller
TC Endings & bi-directional sign... Yesterday at 21:04 - TUT
Introductions thread - please sa... Yesterday at 11:23 - TraineeSignaller
At what point does it become too... 03/05/2024 at 15:28 - Giantray
Bristol MAS 1970, Stage 3D 02/05/2024 at 17:44 - pedroathome
Wrong route set in timetable "Ru... 02/05/2024 at 13:17 - Hoggorm
Carlisle 1979-1980 Timetable 02/05/2024 at 00:18 - AndyG
Cornwall Pathing Bug 01/05/2024 at 20:06 - Smudgykins
HU1402 and Runround Movements in... 01/05/2024 at 18:20 - GeoffM
Phoning North Sidings question 01/05/2024 at 09:25 - Essexgirl
recursive bug report 26/04/2024 at 11:18 - bugsy
Mockups 25/04/2024 at 17:11 - madaboutrains
York South Outer 25/04/2024 at 11:49 - HST125Scorton
Is it possible to delete trains? 24/04/2024 at 21:45 - BJTaylor

Index
Latest posts

User

Log in
Register
What's my IP?
Search

Upcoming Events

No events to display

Who's Online

postal, BaronVonSmart, TUT, Banners88, geswedey, Oddjob, NeoJade, Soton_Speed, romain, Dionysusnu, Al McLean, Meld, HST125Scorton, KymriskaDraken, simonstops, TS_trainspotter, Fishkung, tjtbcork, 0D07, Razzabazza123, njimiller (21 users seen recently)

OCR of tables (e.g. WTTs)

You are here: Home > Forum > Miscellaneous > Open mic (non-railway) > OCR of tables (e.g. WTTs)

Page 1 of 1

1

Swipe the screen to the left to view more details

OCR of tables (e.g. WTTs) 12/01/2023 at 14:18 #150131
DonRiver 151 posts	Was wondering if anyone's had a go at using OCR to parse scanned timetables, e.g. those in Network Rail's archive? Just looking at Tesseract OCR's documentation (tesseract-ocr.github.io) - it's designed for reading paragraphs of text, not tables - wondering if there's off-the-shelf image processing techniques for recognising each column by its borders, cropping it out of the image, and OCR'ing it in isolation… it _might_ not actually be difficult in Python (named for the one in Tasmania, not in Russia) Log in to reply

OCR of tables (e.g. WTTs) 12/01/2023 at 16:08 #150132
bill_gensheet 1318 posts	No, but just tried to see how it would go: https://www.onlineocr.net/pdftoexcel Seemed quite good except for dealing with times ending ½ which went to % or 1/2. While fixing the % is easy, 11/221/2 is more complicated to get to 11/22 ½ However that was a 2015 file, which looked like it was printed to pdf rather than scanned. Log in to reply The following user said thank you: DonRiver

1