


Why do the pdf files opened by the pdf viewer have garbled characters?
I use CAJViewer
CAJViewer5.5_OCR v5.5.0 Build 4030
Description: With OCR recognition, with multi-language package, OCR recognition supports Chinese and English recognition. Size: 32.911 MB
1) Partial text recognition: directly use the ocr of caj browser
Save the print file in MDI format, and then open the file using Microsoft Office Document Image. Select "Use OCR to recognize text" under the Tools menu to identify text content. After completing the recognition, select "Send Text to Word" under the Tools menu to output the recognition results of the entire PDF file to a Word file.
Please note: Microsoft Office Document Image can recognize and convert Chinese, English and table content very accurately. However, it cannot directly output graphics to a Word document. Instead, it forms all graphics in the file into independent picture files and places them in the same folder with the same folder name as the original file. Therefore, you can use Snagit software to open these graphic files and copy and paste them into Word. (It should be noted that all recognition software cannot handle the problem of pattern recognition well, and the processing method of Microsoft Office Document Image is already one of the best solutions to solve this problem.)
Recommended quick method:
Before extracting text from CAJ files, the following preparations are required: First, make sure that CAJ file browser 5.5 and Office2003 are installed, and the Office tool Microsoft Office Document Imaging is fully installed. Once the installation is complete, you will see the Microsoft Office Document Image Writer printer in the printer list. With Microsoft Office Document Image, you can recognize and convert Chinese, English, table and other document contents with high accuracy. These preparations can ensure that you can successfully extract the text information in the CAJ file.
Identification of CAJ files:
(1) First, download the CAJ format data file from the Internet and save it to the local hard disk.
(2) Then, start the CAJViewer browser program and open the CAJ format file just saved in the program. After browsing the file to the last page, do not close the CAJ browser program.
(3) In the CAJ browser program window, select "File" → "Print", and select the printer as the Microsoft Office Document Image Writer printer, check the print to file option and determine the number of pages to print.
(4) Save the print file (*.prn) to the appropriate location. After waiting for printing to complete, Microsoft Office Document Image automatically opens the print file you just saved.
(5) In the Microsoft Office Document Image window, select the "Select All Pages" menu item in the "Page" menu, and then select "Use OCR to recognize text" in the "Tools" menu to extract text.
(6) Select "Send text to word" under "Tools", and finally the entire CAJ file recognition will be output to the word file.
How to fix garbled characters when opening a word document using wps
Sometimes when you open a Word document, you may see that the document has become a bunch of garbled characters. Don’t worry, you can try the following two methods to save your files.
1. Replacement format method .heike123.com
Is to save the damaged Word document in another format.
1. Open the damaged document and click the "File/Save As" menu. In the "Save Type" list, select "RTF Format", then click the "Save" button and close Word.
2. Open the RTF format file you just saved, and use "Save As" again to save the file as a "Word Document". Now open the word file and you will find that the file has been restored.
If the file still cannot be recovered after converting it to rtf format, you can convert the file to plain text format (*.txt) again, and then convert it back to Word format. Of course, the pictures and other information will be lost when converting to txt file.
How to solve the problem of garbled characters when converting PDF to word document
Some PDF files will be garbled when converted into word documents. I have used a lot of conversion software, but the result is that the text is still garbled. In order to solve this problem, I used the following stupid method:
1. Double-click to open the PDF file. Of course, you must download and install the PDF converter in advance
2. Convert Chinese text in PDF to editable word document. The method is: (in the opened PDF file) click: File-Save As, and after "Save as type", select: "TXT file (*.txt )", select "Desktop" after "Save in", click "Save", open the txt document on the desktop (with the same name as the PDF), select the text, copy and paste it into the word document.
3. Copy the pictures in the PDF to the word document. The method is: (in the open PDF file) click: Tools-Snapshot (if the picture is larger, please click the "Reduce" tool in the second line to until you can see the whole picture), select the picture (press and hold the left button of the mouse in the upper left corner of the picture, drag to the lower right corner, then a dotted box should appear, release the mouse), in the open word document Paste in place (Ctrl V).
4. At this time, you can edit the text in the word document to what you want. Of course, the pictures in it can only be formatted and cannot be edited.
The above 2 can also be done like this: (in the open PDF file), click: Tools-Text Viewer (the text in the PDF is already in text form), then right-click "Select All"-"Copy", Just "paste" it into Word. Although this method is page by page, it can be similar to the original layout in the word document. Then click: Tools-Text Viewer (you can also click Alt 9 repeatedly) to enter the PDF reader interface (or text interface).
Steps to use the online PDF to Word converter:
Step one: Upload the PDF file that needs to be converted. It will show that the file you uploaded is successful. Click to generate a word document;
Step 2: Wait for server processing;
Step 3: Download the word document and save it on your computer.
The above is the detailed content of Why do pdf files opened using pdf viewer display garbled characters?. For more information, please follow other related articles on the PHP Chinese website!

This article addresses the Windows "INVALID_DATA_ACCESS_TRAP" (0x00000004) error, a critical BSOD. It explores common causes like faulty drivers, hardware malfunctions (RAM, hard drive), software conflicts, overclocking, and malware. Trou

This article provides practical tips for maintaining ENE SYS systems. It addresses common issues like overheating and data corruption, offering preventative measures such as regular cleaning, backups, and software updates. A tailored maintenance s

Article discusses editing Windows Registry, precautions, backup methods, and potential issues from incorrect edits. Main issue: risks of system instability and data loss from improper changes.

Article discusses managing Windows services for system health, including starting, stopping, restarting services, and best practices for stability.

What does the drive health warning in Windows Settings mean and what should you do when you receive the disk warning? Read this php.cn tutorial to get step-by-step instructions to cope with this situation.

This article identifies five common pitfalls in ENE SYS implementation: insufficient planning, inadequate user training, improper data migration, neglecting security, and insufficient testing. These errors can lead to project delays, system failures

This article identifies ene.sys as a Realtek High Definition Audio driver component. It details its function in managing audio hardware, emphasizing its crucial role in audio functionality. The article also guides users on verifying its legitimacy

This article addresses the failure of the Windows asio.sys audio driver. Common causes include corrupted system files, hardware/driver incompatibility, software conflicts, registry issues, and malware. Troubleshooting involves SFC scans, driver upda


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

PhpStorm Mac version
The latest (2018.2.1) professional PHP integrated development tool

Dreamweaver Mac version
Visual web development tools

Notepad++7.3.1
Easy-to-use and free code editor

MinGW - Minimalist GNU for Windows
This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.

SublimeText3 Mac version
God-level code editing software (SublimeText3)
