search
HomeBackend DevelopmentPHP TutorialUse PHP to call the Lucene package to implement full-text search_PHP tutorial

Use PHP to call the Lucene package to implement full-text search_PHP tutorial

Jul 13, 2016 pm 05:41 PM
lucenephpuseFull Text SearchInsideBagaccomplishrightWorkquantityusewebsitetransferneed

Due to work needs, it is necessary to use PHP to implement full-text retrieval of a large number of websites.
And the most popular search engine library for full-text retrieval is Lucene.
It is a sub-project of Apache Jakarta. It also provides simple and practical APIs.
Using these APIs, you can perform full-text search on any basic text data (including databases).


Because PHP itself supports calling external Java classes, I first wrote a class in Java.
This class implements two methods by calling the Lucene API:

public String createIndex(String indexDir_path,String dataDir_path)
public String searchword(String ss,String index_path)
where createIndex is the index creation method,
passes in two parameters namely indexDir_path(index file directory), dataDir_path (indexed file directory), returns the indexed file list string,
The other is searchword, which retrieves the index through the passed keyword parameter (ss), index_path is the index file directory. Returns all retrieved files.

Here is the source code, it is very simple, you can refer to it: TxtFileIndexer.java

The PHP program calls these two methods to realize the call to Lucene, thereby achieving the purpose of full-text retrieval.
The calling method of PHP is as follows:
First create an instance of the TxtFileIndexer class we wrote,

$tf = new Java(TestLucene.TxtFileIndexer);

Then call the normal PHP class calling method, first create the index:

$data_path = "F:/test/php_lucene/htdocs/data/manual"; //The directory that defines the indexed content
$index_path = "F:/test/php_lucene/htdocs/data/search"; //Define the generated index file storage directory
$s = $tf->createIndex($index_path,$data_path); //Call the Java class method
print $s; //Print the returned result

Try searching again this time:

$index_path = "F:/test/php_lucene/htdocs/data/search"; //Define the generated index file storage directory
$s = $tf->searchword("here is keyword for search" ,$index_path);
print $s;

Also pay attention to the path of the Java class, which can be set in PHP

java_require("F:/test/php_lucene/htdocs/lib/"); //This is an example, my classes and Lucene are both placed in this directory

That’s it, isn’t it?

PHP source code: test.php


Next, let me talk about the environment configuration.
First of all, you need to have Java SDK, which is a must. I am using version 1.4.2, other versions should be fine.
PHP5, I tried PHP4, it should work.

Since the Java extension of PHP5 has not been adjusted, and calling Java in the past has been very inefficient and slow, I used the Php/Java Bridge project.

1. Download JavaBridge
URL: http://sourceforge.net/projects/php-java-bridge/
The current version is
php-java-bridge_3.0.8_j2ee.zip

After unpacking, copy
JavaBridgeWEB-INFcgijava-x86-windows.dll
JavaBridgeWEB-INFlibJavaBridge.jar
to the c:phpext directory, and copy
java-x86-windows.dll Renamed to php_java.dll


2. Modify php.ini (example)
extension=php_java.dll

[Java]
java.class.path = "C:phpextJavaBridge.jar;F:testphp_lucenehtdocs"
java.java_home = "C:j2sdk1.4.2_10"
java.library.path = "c:phpext;F:testphp_lucenehtdocs"

3. Restart Apache.

4. You can find some files for indexing
You can modify the paths of index files and data files in test.php.
Line 37 of TxtFileIndexer.java limits indexing to only files with html suffix, which can be modified if necessary.

According to the current situation (JavaBridge supports Linux and Freebsd), it can be run under
linux or freebsd/apache2/php4/lucene/JavaBridge
environment.

www.bkjia.comtruehttp: //www.bkjia.com/PHPjc/486089.htmlTechArticleDue to work needs, it is necessary to use PHP to perform full-text retrieval on a large number of websites, and the most popular full-text retrieval is currently The best search engine library is Lucene, which is Apache Jakart...
Statement
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
Dependency Injection in PHP: Avoiding Common PitfallsDependency Injection in PHP: Avoiding Common PitfallsMay 16, 2025 am 12:17 AM

DependencyInjection(DI)inPHPenhancescodeflexibilityandtestabilitybydecouplingdependencycreationfromusage.ToimplementDIeffectively:1)UseDIcontainersjudiciouslytoavoidover-engineering.2)Avoidconstructoroverloadbylimitingdependenciestothreeorfour.3)Adhe

How to Speed Up Your PHP Website: Performance TuningHow to Speed Up Your PHP Website: Performance TuningMay 16, 2025 am 12:12 AM

ToimproveyourPHPwebsite'sperformance,usethesestrategies:1)ImplementopcodecachingwithOPcachetospeedupscriptinterpretation.2)Optimizedatabasequeriesbyselectingonlynecessaryfields.3)UsecachingsystemslikeRedisorMemcachedtoreducedatabaseload.4)Applyasynch

Sending Mass Emails with PHP: Is it Possible?Sending Mass Emails with PHP: Is it Possible?May 16, 2025 am 12:10 AM

Yes,itispossibletosendmassemailswithPHP.1)UselibrarieslikePHPMailerorSwiftMailerforefficientemailsending.2)Implementdelaysbetweenemailstoavoidspamflags.3)Personalizeemailsusingdynamiccontenttoimproveengagement.4)UsequeuesystemslikeRabbitMQorRedisforb

What is the purpose of Dependency Injection in PHP?What is the purpose of Dependency Injection in PHP?May 16, 2025 am 12:10 AM

DependencyInjection(DI)inPHPisadesignpatternthatachievesInversionofControl(IoC)byallowingdependenciestobeinjectedintoclasses,enhancingmodularity,testability,andflexibility.DIdecouplesclassesfromspecificimplementations,makingcodemoremanageableandadapt

How to send an email using PHP?How to send an email using PHP?May 16, 2025 am 12:03 AM

The best ways to send emails using PHP include: 1. Use PHP's mail() function to basic sending; 2. Use PHPMailer library to send more complex HTML mail; 3. Use transactional mail services such as SendGrid to improve reliability and analysis capabilities. With these methods, you can ensure that emails not only reach the inbox, but also attract recipients.

How to calculate the total number of elements in a PHP multidimensional array?How to calculate the total number of elements in a PHP multidimensional array?May 15, 2025 pm 09:00 PM

Calculating the total number of elements in a PHP multidimensional array can be done using recursive or iterative methods. 1. The recursive method counts by traversing the array and recursively processing nested arrays. 2. The iterative method uses the stack to simulate recursion to avoid depth problems. 3. The array_walk_recursive function can also be implemented, but it requires manual counting.

What are the characteristics of do-while loops in PHP?What are the characteristics of do-while loops in PHP?May 15, 2025 pm 08:57 PM

In PHP, the characteristic of a do-while loop is to ensure that the loop body is executed at least once, and then decide whether to continue the loop based on the conditions. 1) It executes the loop body before conditional checking, suitable for scenarios where operations need to be performed at least once, such as user input verification and menu systems. 2) However, the syntax of the do-while loop can cause confusion among newbies and may add unnecessary performance overhead.

How to hash strings in PHP?How to hash strings in PHP?May 15, 2025 pm 08:54 PM

Efficient hashing strings in PHP can use the following methods: 1. Use the md5 function for fast hashing, but is not suitable for password storage. 2. Use the sha256 function to improve security. 3. Use the password_hash function to process passwords to provide the highest security and convenience.

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Roblox: Bubble Gum Simulator Infinity - How To Get And Use Royal Keys
4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌
Nordhold: Fusion System, Explained
4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌
Mandragora: Whispers Of The Witch Tree - How To Unlock The Grappling Hook
4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌
Clair Obscur: Expedition 33 - How To Get Perfect Chroma Catalysts
2 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

ZendStudio 13.5.1 Mac

ZendStudio 13.5.1 Mac

Powerful PHP integrated development environment

SublimeText3 Linux new version

SublimeText3 Linux new version

SublimeText3 Linux latest version

Safe Exam Browser

Safe Exam Browser

Safe Exam Browser is a secure browser environment for taking online exams securely. This software turns any computer into a secure workstation. It controls access to any utility and prevents students from using unauthorized resources.

VSCode Windows 64-bit Download

VSCode Windows 64-bit Download

A free and powerful IDE editor launched by Microsoft