<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>BasicCAT — Computer-Aided Translation (CAT) Tools</title>
    <description>BasicCAT Website</description>
    <link>https://www.basiccat.org/</link>
    <atom:link href="https://www.basiccat.org/feed.xml" rel="self" type="application/rss+xml"/>
    <pubDate>Sat, 16 May 2026 13:04:38 +0000</pubDate>
    <lastBuildDate>Sat, 16 May 2026 13:04:38 +0000</lastBuildDate>
    <generator>Jekyll v3.10.0</generator>
    
      <item>
        <title>Browser Extension for Manga Translation</title>
        <description>&lt;p&gt;There are many manga that never get translated. When we browse manga websites through a browser, we wish to see translated versions of manga images directly, but browsers don’t have this feature built in. That’s where browser extensions (add-ons) come in.&lt;/p&gt;

&lt;p&gt;The computer‑aided image translation software &lt;a href=&quot;/imagetrans/&quot;&gt;ImageTrans&lt;/a&gt; provides such a browser extension, supporting Chrome, Edge, Firefox, and other browsers.&lt;/p&gt;

&lt;p&gt;Manga translation typically involves the following steps:&lt;/p&gt;

&lt;ul&gt;
  &lt;li&gt;OCR text recognition&lt;/li&gt;
  &lt;li&gt;Layout analysis – merging text based on semantics and adjusting text order&lt;/li&gt;
  &lt;li&gt;Translation (using large language models such as ChatGPT, Deepseek, etc.)&lt;/li&gt;
  &lt;li&gt;Text removal (inpainting)&lt;/li&gt;
  &lt;li&gt;Back‑filling the translated text (lettering)&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Depending on the language and purpose, we can use different software combinations and translation strategies.&lt;/p&gt;

&lt;p&gt;For example, the text removal step can be time‑consuming if processed with AI. If our goal is merely to understand the meaning, we can use a white background fill instead. The same applies to OCR and translation.&lt;/p&gt;

&lt;p&gt;For OCR, if the language is a mainstream one like Chinese or English, local OCR such as PaddleOCR works quickly with good recognition accuracy. For vertical Japanese, traditional Chinese, etc., higher‑accuracy models are usually needed – for instance, mangaOCR for Japanese.&lt;/p&gt;

&lt;p&gt;For translation, ordinary machine translation often performs well for languages with strong logical structures like English. For high‑context languages like Japanese, especially when OCR may contain errors, a high‑quality large language model is required – sometimes even feeding the image to the model for better semantic understanding.&lt;/p&gt;

&lt;p&gt;The browser extension of ImageTrans supports freely combining various OCR and translation technologies to meet the translation needs of various manga images. The entire process can run on your local computer, free to use, no subscription required, no pay‑per‑page fees, with faster response times and better controllability. This is the method I recommend for translating images in web pages.&lt;/p&gt;

&lt;p&gt;Below is a demo video:&lt;/p&gt;

&lt;p&gt;The video demonstrates how to use ImageTrans’s browser extension to translate manga images from Pixiv, from Japanese to English. It shows purely browser‑based local processing methods – region‑based screenshot translation, full‑image translation – as well as using the ImageTrans desktop edition for higher‑quality translation.&lt;/p&gt;

&lt;iframe width=&quot;560&quot; height=&quot;315&quot; src=&quot;https://www.youtube.com/embed/kRN4o2UWkSQ?si=voLKSRTQJlJUByMT&quot; title=&quot;YouTube video player&quot; frameborder=&quot;0&quot; allow=&quot;accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share&quot; referrerpolicy=&quot;strict-origin-when-cross-origin&quot; allowfullscreen=&quot;&quot;&gt;&lt;/iframe&gt;
</description>
        <pubDate>Sat, 16 May 2026 12:22:50 +0000</pubDate>
        <link>https://www.basiccat.org/manga-image-translation-browser-extension/</link>
        <guid isPermaLink="true">https://www.basiccat.org/manga-image-translation-browser-extension/</guid>
        
        <category>imagetrans</category>
        
        
        <category>blog</category>
        
      </item>
    
      <item>
        <title>Remove Text in Manga and Preserve Screentone</title>
        <description>&lt;p&gt;In the lettering step of manga translation, the original text needs to be erased and replaced with the translated text.&lt;/p&gt;

&lt;p&gt;One of its tedious tasks is restoring screentones. Most &lt;a href=&quot;https://post.smzdm.com/p/ar085mx7/&quot;&gt;tutorials&lt;/a&gt; use Photoshop’s clone stamp tool, which requires careful alignment with the screentone and struggles with complex patterns like gradients. Photoshop’s built-in content-aware fill is also unpredictable and may fill in irrelevant image content.&lt;/p&gt;

&lt;p&gt;Another method involves using custom patterns and overlaying them on a separate layer, as explained in the &lt;a href=&quot;https://docs.krita.org/en/reference_manual/layers_and_masks/fill_layer_generators/screentone.html#term-Macrocell&quot;&gt;Krita documentation&lt;/a&gt;, but the supported patterns are limited. More professional software like CLIP STUDIO PAINT and ComicWorks also require significant manual effort.&lt;/p&gt;

&lt;p&gt;With advances in image processing and AI technology, several software tools now offer effective screentone restoration.&lt;/p&gt;

&lt;p&gt;The computer-assisted manga translation software ImageTrans provides an all-in-one manga translation platform, covering processes from scanning, text recognition, translation to typesetting.&lt;/p&gt;

&lt;p&gt;It integrates several methods for text masking and text removal. Below is a comparison of the results produced by some of its methods that handle screentones well.&lt;/p&gt;

&lt;p&gt;Original image:&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/album/screentones/capture001.jpg&quot; alt=&quot;Original&quot; /&gt;&lt;/p&gt;

&lt;p&gt;Text mask:&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/album/screentones/capture001.jpg-mask.png&quot; alt=&quot;Mask&quot; /&gt;&lt;/p&gt;

&lt;p&gt;PatchMatch (traditional image processing):&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/album/screentones/patchmatch.jpg&quot; alt=&quot;PatchMatch&quot; /&gt;&lt;/p&gt;

&lt;p&gt;Flux Klein 9B：&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/album/screentones/flux.jpg&quot; alt=&quot;Flux&quot; /&gt;&lt;/p&gt;

&lt;p&gt;Lama Inpaint (original):&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/album/screentones/original.jpg&quot; alt=&quot;Original Lama&quot; /&gt;&lt;/p&gt;

&lt;p&gt;Lama Inpaint (fine-tuned for Japanese manga):&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/album/screentones/fine-tuned.jpg&quot; alt=&quot;Fine-tuned Lama&quot; /&gt;&lt;/p&gt;

&lt;p&gt;Gemini 3.0 (Nano Banana):&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/album/screentones/nano-banana-2.jpg&quot; alt=&quot;Gemini&quot; /&gt;&lt;/p&gt;

&lt;p&gt;As you can see, the results are quite good, especially the fine-tuned Lama’s, and even when there are minor flaws, only a small amount of correction is needed.&lt;/p&gt;

&lt;p&gt;These generative AI models have powerful repainting capabilities. Besides restoring screentones, they can also reconstruct various backgrounds covered by text, such as human figures, landscapes, and other content.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/album/imagetrans-features/text-removal-and-reinjection.jpg&quot; alt=&quot;Example 2&quot; /&gt;&lt;/p&gt;
</description>
        <pubDate>Mon, 11 May 2026 11:21:50 +0000</pubDate>
        <link>https://www.basiccat.org/remove-text-in-manga-preserving-screentone/</link>
        <guid isPermaLink="true">https://www.basiccat.org/remove-text-in-manga-preserving-screentone/</guid>
        
        <category>imagetrans</category>
        
        
        <category>blog</category>
        
      </item>
    
      <item>
        <title>How to Translate Images on a Web Page</title>
        <description>&lt;p&gt;Browsers all have built-in webpage translation features, but they usually can only translate selectable text, not images. Yandex supports image translation, but with unsatisfactory results. Safari allows copying text but has no translation function. The computer-assisted image translation software &lt;a href=&quot;/imagetrans/&quot;&gt;ImageTrans&lt;/a&gt; provides a Chrome browser extension that supports direct translation of images on web pages, erasing the original text, filling in the translated text, and restoring colors, rotations, and various styles.&lt;/p&gt;

&lt;p&gt;Below is its processing workflow:&lt;/p&gt;

&lt;ol&gt;
  &lt;li&gt;Through the browser extension, an image translation request is sent to the server.&lt;/li&gt;
  &lt;li&gt;The server calls the connected ImageTrans program to perform the translation.&lt;/li&gt;
  &lt;li&gt;ImageTrans carries out text recognition with OCR, text translation, text inpainting and typesetting, generating a translated image.&lt;/li&gt;
  &lt;li&gt;After receiving the translation result, the browser extension replaces the original image with the translated version.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Because this operates within the browser, some technical limitations must be overcome:&lt;/p&gt;

&lt;ol&gt;
  &lt;li&gt;Browsers restrict the use of local programs. The local server needs to add headers that allow local invocation. Older versions of Chrome also require HTTPS.&lt;/li&gt;
  &lt;li&gt;If an image is cross‑origin restricted, it cannot be downloaded directly, and fetching image data via canvas will result in a tainted canvas error. In this case, the extension must modify headers to remove CORS restrictions. If referrer information is also restricted, or if the image is not an &lt;code class=&quot;language-plaintext highlighter-rouge&quot;&gt;&amp;lt;img&amp;gt;&lt;/code&gt; element but a canvas, making direct download impossible, a screenshot feature can be used—but content outside the viewport will be lost. Pixiv, for example, restricts cross‑origin access.&lt;/li&gt;
  &lt;li&gt;Some websites overlay additional content on images, so replacing the image alone does not display the translated version. For instance, X.com.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;The ImageTrans extension can already translate images on common websites. Tested sites include Read Comics Online, Pixiv, MangaDex, Lezhin, Tencent Comics, and more. It supports various image types: manga, webtoon, manhua, comics, illustration, scanned books, etc.&lt;/p&gt;

&lt;p&gt;Many browser extensions on the market rely on remote servers and require a subscription. ImageTrans, by contrast, runs entirely on the local computer, allowing unlimited translation, using various OCR engines and large language models, directly downloading images to the local machine, preserving translation results, and freely defining text styles.&lt;/p&gt;

&lt;p&gt;Extension link: &lt;a href=&quot;https://github.com/xulihang/ImageTrans_chrome_extension&quot;&gt;https://github.com/xulihang/ImageTrans_chrome_extension&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Demo video:&lt;/p&gt;

&lt;iframe src=&quot;//player.bilibili.com/player.html?isOutside=true&amp;amp;aid=116533535705831&amp;amp;bvid=BV1TKdPBcE5G&amp;amp;cid=38164957756&amp;amp;p=1&quot; scrolling=&quot;no&quot; border=&quot;0&quot; frameborder=&quot;no&quot; framespacing=&quot;0&quot; allowfullscreen=&quot;true&quot;&gt;&lt;/iframe&gt;

&lt;p&gt;PS:&lt;/p&gt;

&lt;p&gt;ImageTrans’s extension also supports OCR right in the browser through WebAssembly. But its pocessing speed is slow and the ability is limited. It is provided as a free option along with the public ImageTrans server. It is recommended to use the extension with ImageTrans desktop version.&lt;/p&gt;

</description>
        <pubDate>Fri, 08 May 2026 12:42:50 +0000</pubDate>
        <link>https://www.basiccat.org/translate-images-on-webpages-with-extension/</link>
        <guid isPermaLink="true">https://www.basiccat.org/translate-images-on-webpages-with-extension/</guid>
        
        <category>imagetrans</category>
        
        
        <category>blog</category>
        
      </item>
    
      <item>
        <title>Real-Time Screen Translator</title>
        <description>&lt;p&gt;Recently, I want to play the raw Japanese GBA game &lt;em&gt;Ace Attorney&lt;/em&gt;. I like the feeling of the original Japanese version and want to improve my Japanese skill. My Japanese level is limited so I still need a screen translator to help me.&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;/imagetrans/&quot;&gt;ImageTrans&lt;/a&gt; is such a program. It integrates with various OCR and translation engines and can work as a real-time screen translator to play games like visual novel and read manga.&lt;/p&gt;

&lt;p&gt;Here is its workflow:&lt;/p&gt;

&lt;ol&gt;
  &lt;li&gt;The program detects whether the screen capture of the region specified by the user has changed and is now stable.&lt;/li&gt;
  &lt;li&gt;Use OCR to recognize the text in the region.&lt;/li&gt;
  &lt;li&gt;Use AI translation engines like ChatGPT and Deepseek to translate the text.&lt;/li&gt;
  &lt;li&gt;(Optional) Add &lt;a href=&quot;/how-to-add-pronunciation-annotation-furigana-to-Japanese-kanji/&quot;&gt;pronuciation annotation&lt;/a&gt; (Furigana for Japanese or Pinyin for Chinese) and use text-to-speech engine to read the text.&lt;/li&gt;
  &lt;li&gt;(Optional) Save the recognition results for &lt;a href=&quot;/how-to-learn-japanese-by-reading-manga/&quot;&gt;learning&lt;/a&gt; afterwards.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;The whole process is automatic. You can also make the screen capture by yourself and ask the program to translate, since you may also need to read the text outside the region you specified.&lt;/p&gt;

&lt;p&gt;The following is a demo video. Although it takes a while for the OCR and translation to complete and there are mistakes, it is playable.&lt;/p&gt;

&lt;iframe width=&quot;560&quot; height=&quot;315&quot; src=&quot;https://www.youtube.com/embed/B9xmNjmcb10?si=rod6GaSBPGQ0KBAX&quot; title=&quot;YouTube video player&quot; frameborder=&quot;0&quot; allow=&quot;accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share&quot; referrerpolicy=&quot;strict-origin-when-cross-origin&quot; allowfullscreen=&quot;&quot;&gt;&lt;/iframe&gt;

&lt;p&gt;ImageTrans can also translate other medias, like books and manga. It is an integrated translation tool, like Photoshop for translation.&lt;/p&gt;

&lt;p&gt;ImageTrans can run on desktop platforms like Windows, macOS and Linux. Mobile operating systems like Android and iOS are currently not supported.&lt;/p&gt;

&lt;h2 id=&quot;optimizing-translation-speed-and-quality&quot;&gt;Optimizing Translation Speed and Quality&lt;/h2&gt;

&lt;p&gt;Real-time translation processes text only within the current input box, lacking contextual information, which leads to lower translation quality. Additionally, performing a translation operation each time results in slower speeds.&lt;/p&gt;

&lt;p&gt;For many games, the text can be extracted, or transcriptions are already available. In such cases, we can pre-translate the text using a large language model. Later, during on-screen translation, the system can match the source text with OCR results and directly retrieve the pre-translated text, achieving both high speed and high quality. This feature is called &lt;strong&gt;translation memory&lt;/strong&gt; in ImageTrans.&lt;/p&gt;

&lt;p&gt;Below is a simple comparison using DeepSeek for translation. It shows that more context leads to higher translation quality.&lt;/p&gt;

&lt;table&gt;
  &lt;thead&gt;
    &lt;tr&gt;
      &lt;th&gt;Original Text&lt;/th&gt;
      &lt;th&gt;Full-Text Translation&lt;/th&gt;
      &lt;th&gt;Sentence-by-Sentence Translation&lt;/th&gt;
    &lt;/tr&gt;
  &lt;/thead&gt;
  &lt;tbody&gt;
    &lt;tr&gt;
      &lt;td&gt;（うう‥‥。 キンチョーするなあ‥‥）&lt;/td&gt;
      &lt;td&gt;(Ugh… I’m so nervous…)&lt;/td&gt;
      &lt;td&gt;(Ugh… Don’t be so nervous…)&lt;/td&gt;
    &lt;/tr&gt;
    &lt;tr&gt;
      &lt;td&gt;‥‥なるほどくん！&lt;/td&gt;
      &lt;td&gt;…Naruhodo-kun!&lt;/td&gt;
      &lt;td&gt;…Mr. I See!&lt;/td&gt;
    &lt;/tr&gt;
    &lt;tr&gt;
      &lt;td&gt;ふう。 なんとか、間にあったわね。&lt;/td&gt;
      &lt;td&gt;Phew. I managed to make it in time.&lt;/td&gt;
      &lt;td&gt;Phew. We made it just in time.&lt;/td&gt;
    &lt;/tr&gt;
    &lt;tr&gt;
      &lt;td&gt;どうかしら？　初めての法廷は。&lt;/td&gt;
      &lt;td&gt;How do you feel? Your first time in court.&lt;/td&gt;
      &lt;td&gt;How about it? Your first time in court?&lt;/td&gt;
    &lt;/tr&gt;
    &lt;tr&gt;
      &lt;td&gt;こ、こんなにドキドキするの、 小学校の学級裁判のとき以来です。&lt;/td&gt;
      &lt;td&gt;Th-This is the first time my heart has raced like this since the class trial in elementary school.&lt;/td&gt;
      &lt;td&gt;My heart’s pounding like this for the first time since the class trial back in elementary school.&lt;/td&gt;
    &lt;/tr&gt;
    &lt;tr&gt;
      &lt;td&gt;‥‥それはそれは。 ずいぶん、ごぶさたしてるのね。&lt;/td&gt;
      &lt;td&gt;…Well, well. It’s been a long time, hasn’t it?&lt;/td&gt;
      &lt;td&gt;…My, my. It’s been quite a while, hasn’t it?&lt;/td&gt;
    &lt;/tr&gt;
  &lt;/tbody&gt;
&lt;/table&gt;

&lt;p&gt;Additionally, ImageTrans supports fuzzy matching between the source text and OCR results. Even if OCR makes errors, it can still find matches, preventing translation mistakes caused by faulty OCR.&lt;/p&gt;

&lt;p&gt;For example, the sentence &lt;code class=&quot;language-plaintext highlighter-rouge&quot;&gt;借りがあるんですよ。 ヤツには。&lt;/code&gt; might be misrecognized as &lt;code class=&quot;language-plaintext highlighter-rouge&quot;&gt;借りがあるんですよ。 ャツには。&lt;/code&gt;, where the character “ヤ” becomes the small kana “ャ”, potentially leading to translation errors. For instance, DeepSeek might mistakenly translate it as “I owe you a favor” instead of “I owe him a favor.” Matching against pre-translated text extracted from the original avoids such issues.&lt;/p&gt;
</description>
        <pubDate>Sat, 11 Apr 2026 12:39:50 +0000</pubDate>
        <link>https://www.basiccat.org/real-time-screen-translator/</link>
        <guid isPermaLink="true">https://www.basiccat.org/real-time-screen-translator/</guid>
        
        <category>imagetrans</category>
        
        
        <category>blog</category>
        
      </item>
    
      <item>
        <title>Manga Typesetting Tool Comparison: TypeR and ImageTrans</title>
        <description>&lt;p&gt;In the world of manga translation, typesetting is where everything comes together. After OCR and translation, you still need to place text back into the artwork in a way that feels natural, readable, and faithful to the original design. This is where tools like &lt;strong&gt;TypeR&lt;/strong&gt; and &lt;strong&gt;ImageTrans&lt;/strong&gt; come in.&lt;/p&gt;

&lt;p&gt;Both target similar users—scanlators, translators, letterers, and comic localizers—but they represent two very different philosophies: one centered on Photoshop, the other on an integrated pipeline.&lt;/p&gt;

&lt;h2 id=&quot;overview&quot;&gt;Overview&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;TypeR&lt;/strong&gt; is a Photoshop extension (forked from TyperTools) designed specifically for manga and comic typesetting. It enhances Photoshop with features like smart alignment, style management, and multi-bubble handling, aiming to speed up manual work without leaving the design environment.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/album/typer/typer.jpg&quot; alt=&quot;typer&quot; /&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;ImageTrans&lt;/strong&gt; is a standalone computer-aided translation tool that covers OCR, translation, and text reinsertion. It can automatically detect text regions, remove original text, and inject translations, while still supporting export back to Photoshop for refinement.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/album/typer/imagetrans.png&quot; alt=&quot;imagetrans&quot; /&gt;&lt;/p&gt;

&lt;h2 id=&quot;workflow-comparison&quot;&gt;Workflow Comparison&lt;/h2&gt;

&lt;h3 id=&quot;typer-photoshop-centric-workflow&quot;&gt;TypeR: Photoshop-Centric Workflow&lt;/h3&gt;

&lt;p&gt;TypeR assumes Photoshop is your main workspace:&lt;/p&gt;

&lt;ol&gt;
  &lt;li&gt;Open image in Photoshop&lt;/li&gt;
  &lt;li&gt;Insert text using TypeR tools&lt;/li&gt;
  &lt;li&gt;Adjust layout manually&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Its biggest strength is eliminating repetitive actions (copy-paste, alignment, style switching) inside Photoshop.&lt;/p&gt;

&lt;p&gt;However:&lt;/p&gt;

&lt;ul&gt;
  &lt;li&gt;OCR and translation must be done elsewhere&lt;/li&gt;
  &lt;li&gt;Workflow is fragmented across tools&lt;/li&gt;
&lt;/ul&gt;

&lt;h3 id=&quot;imagetrans-end-to-end-workflow&quot;&gt;ImageTrans: End-to-End Workflow&lt;/h3&gt;

&lt;p&gt;ImageTrans covers the full pipeline:&lt;/p&gt;

&lt;ol&gt;
  &lt;li&gt;Load images&lt;/li&gt;
  &lt;li&gt;Detect and extract text (OCR)&lt;/li&gt;
  &lt;li&gt;Translate using multiple engines&lt;/li&gt;
  &lt;li&gt;Remove original text and reinsert translation&lt;/li&gt;
  &lt;li&gt;Export to PSD for final refinement&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;It integrates OCR, large language model translation, text cleaning, typesetting and batch processing into one system.&lt;/p&gt;

&lt;p&gt;This reduces tool switching and enables large-scale workflows.&lt;/p&gt;

&lt;h2 id=&quot;feature-comparison-table&quot;&gt;Feature Comparison Table&lt;/h2&gt;

&lt;table&gt;
  &lt;thead&gt;
    &lt;tr&gt;
      &lt;th&gt;Feature&lt;/th&gt;
      &lt;th&gt;TypeR&lt;/th&gt;
      &lt;th&gt;ImageTrans&lt;/th&gt;
    &lt;/tr&gt;
  &lt;/thead&gt;
  &lt;tbody&gt;
    &lt;tr&gt;
      &lt;td&gt;Core Type&lt;/td&gt;
      &lt;td&gt;Photoshop plugin&lt;/td&gt;
      &lt;td&gt;Standalone application&lt;/td&gt;
    &lt;/tr&gt;
    &lt;tr&gt;
      &lt;td&gt;OCR (text extraction)&lt;/td&gt;
      &lt;td&gt;❌ Not supported&lt;/td&gt;
      &lt;td&gt;✅ Built-in multi-engine OCR&lt;/td&gt;
    &lt;/tr&gt;
    &lt;tr&gt;
      &lt;td&gt;Translation integration&lt;/td&gt;
      &lt;td&gt;❌ External only&lt;/td&gt;
      &lt;td&gt;✅ Multiple MT engines + CAT features&lt;/td&gt;
    &lt;/tr&gt;
    &lt;tr&gt;
      &lt;td&gt;Text detection&lt;/td&gt;
      &lt;td&gt;❌ Manual&lt;/td&gt;
      &lt;td&gt;✅ Automatic detection (AI + heuristics)&lt;/td&gt;
    &lt;/tr&gt;
    &lt;tr&gt;
      &lt;td&gt;Text removal (cleaning)&lt;/td&gt;
      &lt;td&gt;❌ Manual in Photoshop&lt;/td&gt;
      &lt;td&gt;✅ Automated (masking, lama inpainting, nano banana, etc.)&lt;/td&gt;
    &lt;/tr&gt;
    &lt;tr&gt;
      &lt;td&gt;Text alignment&lt;/td&gt;
      &lt;td&gt;✅ Yes&lt;/td&gt;
      &lt;td&gt;✅ Yes&lt;/td&gt;
    &lt;/tr&gt;
    &lt;tr&gt;
      &lt;td&gt;Text reinsertion&lt;/td&gt;
      &lt;td&gt;✅ Manual typesetting&lt;/td&gt;
      &lt;td&gt;✅ Automatic + adjustable reinjection&lt;/td&gt;
    &lt;/tr&gt;
    &lt;tr&gt;
      &lt;td&gt;Multi-bubble handling&lt;/td&gt;
      &lt;td&gt;✅ Yes&lt;/td&gt;
      &lt;td&gt;✅ Yes&lt;/td&gt;
    &lt;/tr&gt;
    &lt;tr&gt;
      &lt;td&gt;Style management&lt;/td&gt;
      &lt;td&gt;✅ Advanced (folders, presets)&lt;/td&gt;
      &lt;td&gt;✅ Global styles&lt;/td&gt;
    &lt;/tr&gt;
    &lt;tr&gt;
      &lt;td&gt;Batch processing&lt;/td&gt;
      &lt;td&gt;❌ Not designed for it&lt;/td&gt;
      &lt;td&gt;✅ Supported&lt;/td&gt;
    &lt;/tr&gt;
    &lt;tr&gt;
      &lt;td&gt;Photoshop integration&lt;/td&gt;
      &lt;td&gt;✅ Native plugin&lt;/td&gt;
      &lt;td&gt;✅ Export to PSD&lt;/td&gt;
    &lt;/tr&gt;
    &lt;tr&gt;
      &lt;td&gt;Automation level&lt;/td&gt;
      &lt;td&gt;Low (manual control)&lt;/td&gt;
      &lt;td&gt;High (semi/fully automated)&lt;/td&gt;
    &lt;/tr&gt;
    &lt;tr&gt;
      &lt;td&gt;Cross-platform&lt;/td&gt;
      &lt;td&gt;Depends on Photoshop&lt;/td&gt;
      &lt;td&gt;✅ Windows / macOS / Linux&lt;/td&gt;
    &lt;/tr&gt;
  &lt;/tbody&gt;
&lt;/table&gt;

&lt;h2 id=&quot;which-one-to-use&quot;&gt;Which One to Use&lt;/h2&gt;

&lt;p&gt;Choose TypeR is you prefer to do the typesetting entirely in Photoshop.&lt;/p&gt;

&lt;p&gt;ImageTrans is a better choice in almost all aspects:&lt;/p&gt;

&lt;ol&gt;
  &lt;li&gt;An integrated environment for document scanning, image extracting from CBZ/PDF, image editing, OCR, translation, cleaning and typesetting.&lt;/li&gt;
  &lt;li&gt;Compatible with basiccaly all Photoshop versions (CS2+) and any other image editors like Krita and GIMP.&lt;/li&gt;
&lt;/ol&gt;

</description>
        <pubDate>Tue, 07 Apr 2026 13:06:50 +0000</pubDate>
        <link>https://www.basiccat.org/manga-typesetting-tool-comparison-typer-imagetrans/</link>
        <guid isPermaLink="true">https://www.basiccat.org/manga-typesetting-tool-comparison-typer-imagetrans/</guid>
        
        <category>imagetrans</category>
        
        
        <category>blog</category>
        
      </item>
    
      <item>
        <title>How to Convert Vertical Japanese PDFs to Horizontal Layout</title>
        <description>&lt;p&gt;Japanese is typically written in vertical, which may be difficult for people accustomed to horizontal text. We can convert it to a horizontal layout.&lt;/p&gt;

&lt;p&gt;PDFs can be divided into two types: those with selectable text and those that cannot be copied. For the selectable type, since the PDF simply renders text at fixed positions, the text often cannot be copied in reading order and requires layout analysis. For non-selectable versions, such as scanned PDFs, OCR software is needed to recognize the text, followed by layout analysis to determine the correct text order.&lt;/p&gt;

&lt;p&gt;The computer-aided image translation software &lt;a href=&quot;/imagetrans/&quot;&gt;ImageTrans&lt;/a&gt; supports processing both types of PDFs, exporting vertically oriented text into a horizontal format.&lt;/p&gt;

&lt;p&gt;Example:&lt;/p&gt;

&lt;p&gt;Original PDF File:&lt;/p&gt;

&lt;iframe src=&quot;/assets/japanese.pdf&quot; width=&quot;240px&quot; height=&quot;320px&quot;&gt;&lt;/iframe&gt;

&lt;p&gt;Converted horizontal text:&lt;/p&gt;

&lt;p&gt;もちろんです以下にランダムな日本語の文章を生成しました。&lt;/p&gt;

&lt;p&gt;一匹の子犬。王都が見えてきた。
でかいのは態度だけで後は何もかもが小さい。&lt;/p&gt;

&lt;p&gt;It also supports adding furigana annotations:&lt;/p&gt;

&lt;p&gt;もちろんです&lt;ruby&gt;以下&lt;rt&gt;いか&lt;/rt&gt;&lt;/ruby&gt;にランダムな&lt;ruby&gt;日本語&lt;rt&gt;にほんご&lt;/rt&gt;&lt;/ruby&gt;の&lt;ruby&gt;文章&lt;rt&gt;ぶんしょう&lt;/rt&gt;&lt;/ruby&gt;を&lt;ruby&gt;生成&lt;rt&gt;せいせい&lt;/rt&gt;&lt;/ruby&gt;しました。&lt;/p&gt;
&lt;p&gt;&lt;ruby&gt;一匹&lt;rt&gt;いっぴき&lt;/rt&gt;&lt;/ruby&gt;の&lt;ruby&gt;子犬&lt;rt&gt;こいぬ&lt;/rt&gt;&lt;/ruby&gt;。&lt;ruby&gt;王都&lt;rt&gt;おうと&lt;/rt&gt;&lt;/ruby&gt;が&lt;ruby&gt;見&lt;rt&gt;み&lt;/rt&gt;&lt;/ruby&gt;えてきた。&lt;/p&gt;
&lt;p&gt;でかいのは&lt;ruby&gt;態度&lt;rt&gt;たいど&lt;/rt&gt;&lt;/ruby&gt;だけで&lt;ruby&gt;後&lt;rt&gt;あと&lt;/rt&gt;&lt;/ruby&gt;は&lt;ruby&gt;何&lt;rt&gt;なに&lt;/rt&gt;&lt;/ruby&gt;もかもが&lt;ruby&gt;小&lt;rt&gt;ちい&lt;/rt&gt;&lt;/ruby&gt;さい。&lt;/p&gt;
</description>
        <pubDate>Wed, 25 Mar 2026 06:25:50 +0000</pubDate>
        <link>https://www.basiccat.org/how-to-convert-vertical-Japanese-PDF-to-horizontal/</link>
        <guid isPermaLink="true">https://www.basiccat.org/how-to-convert-vertical-Japanese-PDF-to-horizontal/</guid>
        
        <category>imagetrans</category>
        
        
        <category>blog</category>
        
      </item>
    
      <item>
        <title>How to Add Pronunciation (Furigana) to Japanese Kanji</title>
        <description>&lt;p&gt;Japanese kanji originated from China and are logographic characters. Their pronunciation can be annotated using romaji or kana. The kana used for this purpose are called furigana.&lt;/p&gt;

&lt;p&gt;Below are some examples of text with pronunciation annotations (raw text and annotated):&lt;/p&gt;

&lt;p&gt;Example 1:&lt;/p&gt;

&lt;p&gt;一匹の子犬。&lt;/p&gt;

&lt;p&gt;&lt;ruby&gt;一匹&lt;rt&gt;いっぴき&lt;/rt&gt;&lt;/ruby&gt;の&lt;ruby&gt;子犬&lt;rt&gt;こいぬ&lt;/rt&gt;&lt;/ruby&gt;。&lt;/p&gt;

&lt;p&gt;Example 2:&lt;/p&gt;

&lt;p&gt;もちろんです以下にランダムな日本語の文章を生成しました。&lt;/p&gt;

&lt;p&gt;もちろんです&lt;ruby&gt;以下&lt;rt&gt;いか&lt;/rt&gt;&lt;/ruby&gt;にランダムな&lt;ruby&gt;日本語&lt;rt&gt;にほんご&lt;/rt&gt;&lt;/ruby&gt;の&lt;ruby&gt;文章&lt;rt&gt;ぶんしょう&lt;/rt&gt;&lt;/ruby&gt;を&lt;ruby&gt;生成&lt;rt&gt;せいせい&lt;/rt&gt;&lt;/ruby&gt;しました。&lt;/p&gt;

&lt;p&gt;Example 3:&lt;/p&gt;

&lt;p&gt;王都が見えてきた。&lt;/p&gt;

&lt;p&gt;&lt;ruby&gt;王都&lt;rt&gt;おうと&lt;/rt&gt;&lt;/ruby&gt;が&lt;ruby&gt;見&lt;rt&gt;み&lt;/rt&gt;&lt;/ruby&gt;えてきた。&lt;/p&gt;

&lt;p&gt;Example 4:&lt;/p&gt;

&lt;p&gt;でかいのは態度だけで後は何もかもが小さい。&lt;/p&gt;

&lt;p&gt;でかいのは&lt;ruby&gt;態度&lt;rt&gt;たいど&lt;/rt&gt;&lt;/ruby&gt;だけで&lt;ruby&gt;後&lt;rt&gt;あと&lt;/rt&gt;&lt;/ruby&gt;は&lt;ruby&gt;何&lt;rt&gt;なに&lt;/rt&gt;&lt;/ruby&gt;もかもが&lt;ruby&gt;小&lt;rt&gt;ちい&lt;/rt&gt;&lt;/ruby&gt;さい。&lt;/p&gt;

&lt;p&gt;Adding pronunciation annotations to Japanese kanji is not a simple task. It requires correctly segmenting Japanese sentences into words and then obtaining the accurate pronunciation of the segmented results. Japanese kanji pronunciation is complex, with on’yomi (Chinese readings) and kun’yomi (Japanese readings), and phonetic changes often occur when multiple characters are combined.&lt;/p&gt;

&lt;p&gt;Kuromoji is a specialized Japanese morphological analysis software that performs word segmentation, grammatical analysis, and pronunciation retrieval based on dictionaries such as ipadic and unidic. It can be used to annotate kanji with pronunciation.&lt;/p&gt;

&lt;p&gt;However, it has some drawbacks.&lt;/p&gt;

&lt;p&gt;The default ipadic dictionary has limited data, leading to inaccurate annotations. Unidic, on the other hand, segments too finely—for example, splitting “日本語” (Japanese) into “日本” and “語,” which results in annotation errors. Additionally, it handles counters like “一匹” (one small animal) poorly.&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;/imagetrans/&quot;&gt;ImageTrans&lt;/a&gt; integrates Kuromoji and combines the results from ipadic and unidic to generate accurate annotations. It also automatically handles counters, achieving better annotation results. This can be used to assist with manga reading, Japanese language learning, and automatic text annotation. If higher accuracy is required, large language models such as GPT or DeepSeek can also be used for annotation.&lt;/p&gt;
</description>
        <pubDate>Wed, 25 Mar 2026 06:25:50 +0000</pubDate>
        <link>https://www.basiccat.org/how-to-add-pronunciation-annotation-furigana-to-Japanese-kanji/</link>
        <guid isPermaLink="true">https://www.basiccat.org/how-to-add-pronunciation-annotation-furigana-to-Japanese-kanji/</guid>
        
        <category>imagetrans</category>
        
        
        <category>blog</category>
        
      </item>
    
      <item>
        <title>How to Detect Text Colors in Images</title>
        <description>&lt;p&gt;Detecting text colors in images, including background colors and stroke colors, can be used for layout restoration, translation, and other operations.&lt;/p&gt;

&lt;p&gt;Original image:&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/album/text-color-detection/source.jpg&quot; alt=&quot;Original image&quot; /&gt;&lt;/p&gt;

&lt;p&gt;Translated image:&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/album/text-color-detection/translated.jpg&quot; alt=&quot;Translated image&quot; /&gt;&lt;/p&gt;

&lt;h2 id=&quot;implementation-principle&quot;&gt;Implementation Principle&lt;/h2&gt;

&lt;p&gt;For background colors, KMeans can be used to cluster colors and identify the dominant colors in the image.&lt;/p&gt;

&lt;p&gt;For text colors, contour detection can be performed to calculate the average pixel value of the text contour and exclude pixels that are similar to the background color.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/album/text-color-detection/plain-color.jpg&quot; alt=&quot;Plain color background&quot; /&gt;&lt;/p&gt;

&lt;p&gt;However, if the background is complex and stroke colors or the color of each character need to be extracted, traditional image processing methods may not perform well. In such cases, convolutional neural networks can be used for extraction.&lt;/p&gt;

&lt;p&gt;The computer-assisted image translation software &lt;a href=&quot;/imagetrans/&quot;&gt;ImageTrans&lt;/a&gt; integrates the algorithms mentioned above.&lt;/p&gt;

&lt;h2 id=&quot;examples&quot;&gt;Examples&lt;/h2&gt;

&lt;p&gt;Below are some operational examples.&lt;/p&gt;

&lt;h3 id=&quot;example-1&quot;&gt;Example 1&lt;/h3&gt;

&lt;p&gt;In many CG images, different colors are often used to distinguish dialogues between different characters.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/album/text-color-detection/example.jpg&quot; alt=&quot;Example&quot; /&gt;&lt;/p&gt;

&lt;p&gt;ImageTrans can detect stroke colors and text colors. Below are the raw recognition results.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/album/text-color-detection/detected.jpg&quot; alt=&quot;Detection results&quot; /&gt;&lt;/p&gt;

&lt;p&gt;As can be seen, the text colors and stroke colors have been detected, but they are not very accurate.&lt;/p&gt;

&lt;p&gt;In the project settings, we can predefine several styles and specify the colors used for each style.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/album/text-color-detection/project-settings.jpg&quot; alt=&quot;Project settings&quot; /&gt;&lt;/p&gt;

&lt;p&gt;Afterward, perform a color matching operation to match the styles based on the text colors.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/album/text-color-detection/workflow.jpg&quot; alt=&quot;Workflow&quot; /&gt;&lt;/p&gt;

&lt;p&gt;This resolves the issue of inaccuracies in the detected colors.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/album/text-color-detection/adjusted.jpg&quot; alt=&quot;Adjusted results&quot; /&gt;&lt;/p&gt;

&lt;h3 id=&quot;example-2&quot;&gt;Example 2&lt;/h3&gt;

&lt;p&gt;Some texts use rich text formatting, where a single line of text may contain multiple colors. ImageTrans supports recognizing the style of each character and outputting results with rich text tags.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/album/text-color-detection/inline-text.jpg&quot; alt=&quot;Inline styles&quot; /&gt;&lt;/p&gt;
</description>
        <pubDate>Sun, 15 Mar 2026 08:28:50 +0000</pubDate>
        <link>https://www.basiccat.org/how-to-detect-text-color-in-image/</link>
        <guid isPermaLink="true">https://www.basiccat.org/how-to-detect-text-color-in-image/</guid>
        
        <category>imagetrans</category>
        
        
        <category>blog</category>
        
      </item>
    
      <item>
        <title>How to Convert Traditional Chinese PDF from Vertical to Horizontal and Enhance Text Clarity</title>
        <description>&lt;p&gt;I recently came across several older Taiwanese academic papers and found that they were all in PDF format with vertical text, Traditional Chinese, and black-and-white scanned copies. The text was illegible, which was so difficult to read that I worried it might worsen my myopia. So, I came up with some ways to optimize the reading experience. For users in Mainland China, converting the vertical text to horizontal format makes it much easier to read. However, sometimes preserving the original layout is better. So I also tried using OCR to recognize the text, remove the original content, and reformat it with clear, readable text. Below is a demonstration of the results and how to do it.&lt;/p&gt;

&lt;h2 id=&quot;demonstration-of-results&quot;&gt;Demonstration of Results&lt;/h2&gt;

&lt;p&gt;Original file (&lt;a href=&quot;https://kmweb.moa.gov.tw/redirect_files.php?id=98635&quot;&gt;PDF&lt;/a&gt;):&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/album/vertical-text-PDF/original.png&quot; alt=&quot;Original file&quot; /&gt;&lt;/p&gt;

&lt;p&gt;Converted to horizontal markdown format:&lt;/p&gt;

&lt;div class=&quot;language-markdown highlighter-rouge&quot;&gt;&lt;div class=&quot;highlight&quot;&gt;&lt;pre class=&quot;highlight&quot;&gt;&lt;code&gt;卷五十第

期一十二第

&lt;span class=&quot;gu&quot;&gt;## 那一個甘蔗品種好?&lt;/span&gt;

&lt;span class=&quot;gu&quot;&gt;## 選擇蔗種的基本原則&lt;/span&gt;

選擇甘蔗品種的基本原則,是「適地適種」,而「適地」的意義,並不是單指適合的土壤的意思,還包括蔗園所在地點的一切環境條件,諸如氣候狀況、灌溉條件、肥料多少、勞力充份否,間作不間作,留宿根不留等等,都要考慮到.只有適合種蔗環境的品種,才能有好的收成,所以在選擇品種之前,首先要明白所經營的蔗園的環境.下面幾點是主要的考慮對象：

（1）土質和地力：土質的粘重或輕鬆,主要關係土壤水分的保持與喪失.粘重的土壤,保水力较强,纵然在灌溉水不够充分的地方,也能栽培比較不耐旱的品種,但是粘重的蔗園容易排水不良,根系太深,不耐浸的品种就发育不良.土质轻松的土壤適恰相反,不適合栽培不耐旱品種,而適應根系深的品種.

甘蔗品種的耐肥性是個個不同的,有的適合在好地方的蔗園,有的在地方好的蔗園反而減產.所以地方的肥沃瘦瘠也需要密切注意.

（2）灌溉的條件：要說甘蔗是水做的,一點也沒有錯,一支甘蔗有七〇％以上是水.因之,水成了種蔗的必具條件,只是甘蔗品種裏有些特別喜愛水一些,缺了水,產量隨即降低,另一些對水淡漠一些,早一點影響產量不大.選擇品種的時候,不能不顧到這點.

（3）宿根：每一個品種的宿根能力多有不同,有些可以宿根,有些宿根产量很低,留宿根化不來,要是選錯了就要自己吃虧.

（4）植期不同：有些品种只能作为秋植,而且必須早秋植；延遲種植,產量降低.另一些品種又只適合春植,作為秋植時並不理想.而二期糊仔有二期糊仔的品種,一期糊仔有一期糊仔的品種,

都混雜不得的.

（5）勞力的多少：有的蔗友家中勞力很多,或者財力雄厚,足以招雇够量的勞力來從事精耕栽培,那麼就應該選擇適合精耕栽培的品種.否則,應該選擇在粗放耕作時仍有相當產量的品種.

（6）病蟲害問題：現在推廣的品種,大致說對於主要的病蟲害都是抵抗的,但在容易發生病蟲害的環境如玉米栽培地區,有些品種比較容易發生露菌病,應該避免栽培.

&lt;span class=&quot;gu&quot;&gt;## 目前推廣的優良蔗種&lt;/span&gt;

談完了選擇品種時考慮的主要種蔗環境條件,現在來選擇品種,首先要介紹一些最近由糖業試驗所育成,在推廣中的甘蔗新品種.

F一四六已經推廣三個年期了,大家對它都已十分認識,它是一個中大莖,多分櫱的品種.初期生育緩慢,而在次年高溫多水的季節迅F一四六速生長.原料莖很多,只是短一點,適合在地力中等以上,有水灌溉,不很粘重的地方作为早秋植,F一四六可以間作,宿根也不錯,最高的產量可以達到三十萬公斤.

F一四八作為春植或晚秋植時,它增產能力比在早秋植時高,同時也不適合二期糊仔.蔗莖又細小一點,所以蔗友們不很歡迎.事實F一四八上,用F一四八作晚植蔗園的品種,它的產量比F一四六,和F一五二安定.而且在所有的新品種中,F一四八是糖份最高的一個,也是最不怕露菌病的品種,又因為它很早熟,種了它,多半是在製糖季初收穫,蔗友們可以提前利用種蔗後的土地.

F一四八也不適合在粘重或過份輕鬆,以及乾旱的地方栽培,間作和宿根的條件不及F一四六.

&lt;span class=&quot;gu&quot;&gt;## F一五一&lt;/span&gt;

F一五一適合在粘土蔗園秋植及二期糊仔

,在溫度較低,土壤較乾的時候,萌

芽比其他新品種好.它是中莖種,原料莖比F一四六細而長,也比F一四六容易倒伏,但不易开花；和F一四六同样地晚熟,步留也差不多.露菌病的感染程度較重.F一五一的新植產量在輕鬆的壤土裏不及F一四六,在粘重的土壤則勝過F一四六.宿根的產量和F一四六相仿.

F一五一不能作為晚植蔗園的品種.

F一五二是地力肥沃,灌溉水充沛蕉園的增產王牌,它的原料莖高而粗,只要保持適當的原料莖數,很容易的得上二十萬公斤以上的F一五一甲當產量.不過在灌溉水缺乏,地力不夠而又沒有辦法施用重肥的蔗園,F一五二沒有增產的希望.

七、八月種植的F一五二容易母莖徒長,因此不如九月種植的蔗園產量穩定.F一五二是個好&lt;span class=&quot;sb&quot;&gt;


&lt;/span&gt;&lt;span class=&quot;gu&quot;&gt;## ·紹介者作期本·&lt;/span&gt;

▼湯冠雄：曾任臺灣糖業試驗所種藝系技師,現任該所虎尾蔗作改良場場長,從事甘蔗栽培研究已十七年.

&lt;span class=&quot;p&quot;&gt;![&lt;/span&gt;&lt;span class=&quot;nv&quot;&gt;图片&lt;/span&gt;&lt;span class=&quot;p&quot;&gt;](&lt;/span&gt;&lt;span class=&quot;sx&quot;&gt;)&lt;/span&gt;

夏輔禹：安東省岡城縣人,臺灣省立農學院畢業,現任糖業試驗所虎尾蔗作改良場研究組組長,一直擔任蔗田間作試驗方面工作.

&lt;span class=&quot;p&quot;&gt;![&lt;/span&gt;&lt;span class=&quot;nv&quot;&gt;图片&lt;/span&gt;&lt;span class=&quot;p&quot;&gt;](&lt;/span&gt;&lt;span class=&quot;sx&quot;&gt;)&lt;/span&gt;

&lt;/code&gt;&lt;/pre&gt;&lt;/div&gt;&lt;/div&gt;

&lt;p&gt;A clear version preserving the original layout (Image + PDF):&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/album/vertical-text-PDF/reconstructed.png&quot; alt=&quot;Clear version&quot; /&gt;&lt;/p&gt;

&lt;p&gt;Text-based PDF with vector text, remains sharp even when zoomed in:&lt;/p&gt;

&lt;p&gt;&lt;a href=&quot;/album/vertical-text-PDF/reconstructed-pdf.pdf&quot;&gt;File Link&lt;/a&gt;&lt;/p&gt;

&lt;h2 id=&quot;how-to-guide&quot;&gt;How-to Guide&lt;/h2&gt;

&lt;p&gt;This requires using the OCR software &lt;a href=&quot;/imagetrans/&quot;&gt;ImageTrans&lt;/a&gt;.&lt;/p&gt;

&lt;ol&gt;
  &lt;li&gt;Use the built-in bubble detection feature to detect vertical text lines.&lt;/li&gt;
  &lt;li&gt;Use a large language model for OCR. Since the text here is quite blurry, only large language models can achieve good results. I chose the &lt;code class=&quot;language-plaintext highlighter-rouge&quot;&gt;qwen3-vl-235b-a22b-instruct&lt;/code&gt; model for this task.&lt;/li&gt;
  &lt;li&gt;Use PaddleOCR’s DocLayout panel detection model to identify text paragraphs and sort the text lines from right to left according to the reading order.&lt;/li&gt;
  &lt;li&gt;Specify text styles for both vertical and horizontal text. Then, determine which text is horizontal and which is vertical based on the aspect ratio.&lt;/li&gt;
  &lt;li&gt;Export as PDF.&lt;/li&gt;
  &lt;li&gt;Next, export as markdown. Here, we can merge text lines by paragraph to remove unnecessary line breaks.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Software main interface:&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/album/vertical-text-PDF/imagetrans.jpg&quot; alt=&quot;Software main interface&quot; /&gt;&lt;/p&gt;

&lt;p&gt;Displaying panels and order:&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/album/vertical-text-PDF/imagetrans-order-mode.jpg&quot; alt=&quot;Displaying panels and order&quot; /&gt;&lt;/p&gt;
</description>
        <pubDate>Sun, 08 Mar 2026 11:32:50 +0000</pubDate>
        <link>https://www.basiccat.org/how-to-convert-Traditional-Chinese-PDF-from-vertical-to-horizontal-and-enhance-text-clarity/</link>
        <guid isPermaLink="true">https://www.basiccat.org/how-to-convert-Traditional-Chinese-PDF-from-vertical-to-horizontal-and-enhance-text-clarity/</guid>
        
        <category>imagetrans</category>
        
        
        <category>blog</category>
        
      </item>
    
      <item>
        <title>How to Scan Negatives</title>
        <description>&lt;p&gt;Photos taken with old film cameras usually leave behind negatives (film) after being developed and printed at a photo lab. I recently came across quite a few of these negatives. The negatives contain a type of silver salt that is highly light-sensitive. Standard negatives are 35mm wide and 24mm high, and the developed content is inverted.&lt;/p&gt;

&lt;p&gt;Because negatives are very small and have an additional orange mask (color cast), digitizing them is not an easy task. A common method is to use a professional film scanner. If a scanner isn’t available, you can also use a mobile phone or a camera to “scan” them by photographing.&lt;/p&gt;

&lt;h2 id=&quot;photographing-with-a-camera&quot;&gt;Photographing with a Camera&lt;/h2&gt;

&lt;ol&gt;
  &lt;li&gt;
    &lt;p&gt;Negatives are transparent and require transmitted light to reveal the image. You can prepare a white light source, such as a phone screen or a ceiling light.&lt;/p&gt;

    &lt;p&gt;&lt;img src=&quot;/album/negative/negative_on_screen.jpg&quot; alt=&quot;Negative on a phone screen&quot; /&gt;&lt;/p&gt;
  &lt;/li&gt;
  &lt;li&gt;
    &lt;p&gt;Use a macro lens to take the photo. For a mobile phone, you can attach an external macro lens.&lt;/p&gt;

    &lt;p&gt;&lt;img src=&quot;/album/negative/macro_lens.jpg&quot; alt=&quot;Macro lens attached to a phone&quot; /&gt;&lt;/p&gt;
  &lt;/li&gt;
  &lt;li&gt;
    &lt;p&gt;After shooting, use software like Adobe Photoshop for further processing: inverting the colors and using the “Auto Color” function to remove the orange mask.&lt;/p&gt;

    &lt;p&gt;&lt;img src=&quot;/album/negative/invert.jpg&quot; alt=&quot;Inverted image&quot; /&gt;&lt;/p&gt;

    &lt;p&gt;&lt;img src=&quot;/album/negative/auto_color_result.jpg&quot; alt=&quot;Result after auto color adjustment&quot; /&gt;&lt;/p&gt;
  &lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Since a mobile phone was used here, there might be issues like insufficient sharpness, inaccurate colors, and screen moiré patterns. Using a professional camera would yield much better results, potentially offering greater sharpness than some scanners. Additionally, professional cameras can save files in raw format, which is more convenient for color correction.&lt;/p&gt;

&lt;h2 id=&quot;scanning-with-a-scanner&quot;&gt;Scanning with a Scanner&lt;/h2&gt;

&lt;p&gt;Choose a flatbed scanner that supports scanning negatives, such as the Epson V850. The one I used here is an Epson V300, which comes with a transparency unit and film holders, making it suitable for scanning negatives.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/album/negative/epson-v300.jpg&quot; alt=&quot;Epson V300 scanner&quot; /&gt;&lt;/p&gt;

&lt;p&gt;You can use the manufacturer’s software, like Epson Scan, to scan. It can automatically crop negatives, invert colors, and remove the orange mask.&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/album/negative/epson-scan.jpg&quot; alt=&quot;Epson Scan software interface&quot; /&gt;&lt;/p&gt;

&lt;p&gt;If you prefer to obtain only the raw scan data, you can use software like &lt;a href=&quot;/imagetrans/&quot;&gt;ImageTrans&lt;/a&gt; to control the scanner via interfaces such as TWAIN, ICA, or SANE.&lt;/p&gt;

&lt;p&gt;Afterwards, you can use ImageTrans or other software for cropping, inverting, and color correction.&lt;/p&gt;

&lt;p&gt;Scanning using ImageTrans:&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/album/negative/imagetrans_transparency_unit.jpg&quot; alt=&quot;ImageTrans scanning interface showing transparency unit option&quot; /&gt;&lt;/p&gt;

&lt;p&gt;Result after cropping, inverting, and color correction using ImageTrans:&lt;/p&gt;

&lt;p&gt;&lt;img src=&quot;/album/negative/imagetrans_processed_photo.jpg&quot; alt=&quot;ImageTrans processed photo result&quot; /&gt;&lt;/p&gt;

</description>
        <pubDate>Fri, 27 Feb 2026 12:28:50 +0000</pubDate>
        <link>https://www.basiccat.org/how-to-scan-negatives/</link>
        <guid isPermaLink="true">https://www.basiccat.org/how-to-scan-negatives/</guid>
        
        <category>imagetrans</category>
        
        
        <category>blog</category>
        
      </item>
    
  </channel>
</rss>
