<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://hypertwins.org/mw/index.php?action=history&amp;feed=atom&amp;title=2024%2F05%2F13</id>
	<title>2024/05/13 - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://hypertwins.org/mw/index.php?action=history&amp;feed=atom&amp;title=2024%2F05%2F13"/>
	<link rel="alternate" type="text/html" href="https://hypertwins.org/mw/index.php?title=2024/05/13&amp;action=history"/>
	<updated>2026-05-01T02:26:40Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.43.0</generator>
	<entry>
		<id>https://hypertwins.org/mw/index.php?title=2024/05/13&amp;diff=25291&amp;oldid=prev</id>
		<title>Woozle at 20:56, 13 May 2024</title>
		<link rel="alternate" type="text/html" href="https://hypertwins.org/mw/index.php?title=2024/05/13&amp;diff=25291&amp;oldid=prev"/>
		<updated>2024-05-13T20:56:44Z</updated>

		<summary type="html">&lt;p&gt;&lt;/p&gt;
&lt;table style=&quot;background-color: #fff; color: #202122;&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Revision as of 20:56, 13 May 2024&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l1&quot;&gt;Line 1:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 1:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;{{woozle/page/journal}}&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;{{woozle/page/journal}}&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;[[Harena]] needed to snag the text from some images, and I thought: Shirley, by now&lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;, &lt;/del&gt;there must be a usable GUI OCR package in Ubuntu.&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;[[Harena]] needed to snag the text from some images, and I thought: Shirley, by now there must be a usable GUI OCR package in Ubuntu.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;So I searched {{l/wp|APT (software)|&amp;lt;code&amp;gt;apt&amp;lt;/code&amp;gt;}} for &amp;quot;OCR&amp;quot;, and... after wading through a bunch of stuff, found a thing called YAGF which is apparently a front-end for either of two CLI apps called {{l/wp|CuneiForm (software)|Cuneiform}} and {{l/wp|Tesseract (software)|Tesseract}} (with the latter being the default, and something I had apparently installed earlier).&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;So I searched {{l/wp|APT (software)|&amp;lt;code&amp;gt;apt&amp;lt;/code&amp;gt;}} for &amp;quot;OCR&amp;quot;, and... after wading through a bunch of stuff, found a thing called YAGF which is apparently a front-end for either of two CLI apps called {{l/wp|CuneiForm (software)|Cuneiform}} and {{l/wp|Tesseract (software)|Tesseract}} (with the latter being the default, and something I had apparently installed earlier).&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;

&lt;!-- diff cache key htorg?hmw:diff:1.41:old-25290:rev-25291:php=table --&gt;
&lt;/table&gt;</summary>
		<author><name>Woozle</name></author>
	</entry>
	<entry>
		<id>https://hypertwins.org/mw/index.php?title=2024/05/13&amp;diff=25290&amp;oldid=prev</id>
		<title>Woozle: Created page with &quot;{{woozle/page/journal}} Harena needed to snag the text from some images, and I thought: Shirley, by now, there must be a usable GUI OCR package in Ubuntu.  So I searched {...&quot;</title>
		<link rel="alternate" type="text/html" href="https://hypertwins.org/mw/index.php?title=2024/05/13&amp;diff=25290&amp;oldid=prev"/>
		<updated>2024-05-13T20:56:20Z</updated>

		<summary type="html">&lt;p&gt;Created page with &amp;quot;{{woozle/page/journal}} &lt;a href=&quot;/Harena&quot; class=&quot;mw-redirect&quot; title=&quot;Harena&quot;&gt;Harena&lt;/a&gt; needed to snag the text from some images, and I thought: Shirley, by now, there must be a usable GUI OCR package in Ubuntu.  So I searched {...&amp;quot;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;{{woozle/page/journal}}&lt;br /&gt;
[[Harena]] needed to snag the text from some images, and I thought: Shirley, by now, there must be a usable GUI OCR package in Ubuntu.&lt;br /&gt;
&lt;br /&gt;
So I searched {{l/wp|APT (software)|&amp;lt;code&amp;gt;apt&amp;lt;/code&amp;gt;}} for &amp;quot;OCR&amp;quot;, and... after wading through a bunch of stuff, found a thing called YAGF which is apparently a front-end for either of two CLI apps called {{l/wp|CuneiForm (software)|Cuneiform}} and {{l/wp|Tesseract (software)|Tesseract}} (with the latter being the default, and something I had apparently installed earlier).&lt;br /&gt;
&lt;br /&gt;
After loading up a sample image to scan, the following sequence of following events followed, following my loading up of a sample image to scan:&lt;br /&gt;
* It claimed I hadn&amp;#039;t installed the English language data files for Tesseract, which I had.&lt;br /&gt;
* I tried re-installing them, then closing and reopening YAGF; no joy.&lt;br /&gt;
* After much searching the web for help, I noticed that the Settings dialog box asks for the location of the Tesseract data files, which had been set to the root folder.&lt;br /&gt;
** It seems somewhat unhelpful to complain about not being able to find files when the location for those files obviously hasn&amp;#039;t been set.&lt;br /&gt;
** I looked in &amp;lt;code&amp;gt;apt&amp;lt;/code&amp;gt; to see where the files might be, and found them in &amp;lt;code&amp;gt;/usr/share/tesseract-ocr/5/tessdata&amp;lt;/code&amp;gt;.&lt;br /&gt;
* After this, YAGF would do what &amp;#039;&amp;#039;appeared&amp;#039;&amp;#039; to be an attempt to parse the image, but it lasted less than a second and produced no output.&lt;br /&gt;
** I wasn&amp;#039;t sure if YAGF would definitely show me the output, so I did a &amp;quot;File &amp;amp;rarr; Save All Text&amp;quot;. The resulting file had zero bytes.&lt;br /&gt;
** I tried modifying the path, in case YAGF was expecting an enclosing or enclosed folder (but wasn&amp;#039;t upset to the point of giving me the error again) -- no go.&lt;br /&gt;
* Then I thought of switching to Cuneiform, in case that works better -- which (after installing it) it does.&lt;br /&gt;
** Running the scan with Cuneiform without installing Cuneiform doesn&amp;#039;t give an error message. This is also unhelpful.&lt;/div&gt;</summary>
		<author><name>Woozle</name></author>
	</entry>
</feed>