How Do I Choose the Best Open Source Search Engine?
Open source search refers to search engines with open source code, which is different from the commercial search engines we usually use, such as Google, Yahoo, etc. The search engine core technologies of these search engine companies are not open to the public.
Open source search
Right!
- Chinese name
- Open source search
- Foreign name
- Open source search
- Nature
- Search engine
- Founded
- Year 2000
- Founder
- JimmyWales
- Earliest region
- United States
- Open source search refers to search engines with open source code. This is different from the commercial search engines we usually use, such as Google and Yahoo. The search engine core technologies of these search engine companies are not open to the public.
- In today's fast-developing information age of the Internet, whoever captures the essence of the Internet is the customer. With customers having traffic and user stickiness, the popularity of search engines will increase. Search engines have an increasing influence on people and even in some aspects and countries. Searches by search engine companies like Google are free to users, but the core technology of his search engine is not open to the outside world, which has led to search engines having a monopoly on obtaining information on the Internet.
- The emergence of open source search engines has brought new hope to search engines.
- Speaking of open source search engines have to say
- Introduction to some open source search engine systems, including open source web search engines and open source desktop search engines.
Sphider Open source search Sphider
- Sphider is a lightweight, web spider and search engine developed using PHP, and uses mysql to store data. You can use it to add search capabilities to your website. Sphider is very small, easy to install and modify, and thousands of websites already use it.
RiSearch PHP RiSearch PHP
- RiSearch PHP is an efficient and powerful search engine, especially suitable for small and medium websites. RiSearch PHP is very fast, it can search 5000-10000 pages in less than 1 second. RiSearch is an indexed search engine, which means that it first indexes your website and builds a database to store keywords for all pages of your website for quick searches. Risearch is a full-text search engine script, which compiles all keywords into a document index except for keywords excluded by definition in the configuration file. RiSearch uses the classic reverse indexing algorithm (same as large search engines), which is why it is faster than other search engines.
PhpDig Open source search PhpDig
- PhpDig is a web crawler and search engine developed using PHP. Build a glossary by indexing dynamic and static pages. When searching for a query, it will display a search results page containing keywords in a certain collation. PhpDig includes a template system and is able to index PDF, Word, Excel, and PowerPoint documents. PHPdig is suitable for more specialized and deeper personalized search engines. Using it to build a vertical search engine targeted at a certain field is the best choice.
OpenWebSpider Open source search OpenWebSpider
- OpenWebSpider is an open source multi-threaded Web Spider (robot: crawler: crawler) and a search engine with many interesting features.
Egothor Open Source Search Egothor
- Egothor is an open source and efficient full text search engine written in Java. With Java's cross-platform features, Egothor can be applied to any environment, both as a separate search engine and for your application for full-text search.
Nutch Open source search Nutch
- Nutch is an open source Java-implemented search engine. It provides all the tools we need to run our own search engine. Includes full-text search and web crawlers.
Apache Lucene Open source search Apache Lucene
- Lucene is a Java-based full-text search engine. With it, you can easily add full-text search capabilities to Java software. Lucene's main job is to index each word of the document. Indexing makes search efficiency much more efficient than traditional verbatim. Lucen provides a set of APIs for interpreting, filtering, analyzing documents, organizing and using indexing. In addition to efficiency and simplicity, the most important thing is that users can customize their functions at any time according to their needs.
Oxyus Open Source Search Oxyus
- Oxyus is a web search engine written in pure java.
BDDBot Open source search BDDBot
- BDDBot is a simple, easy to understand and use search engine. It now crawls through URLs listed in a text file (urls.txt), saving the results in a database. It also supports a simple web server that accepts queries from the browser and returns response results. It can be easily integrated into your Web site.
Zilverline Open source search for Zilverline
- Zilverline is a search engine that searches the local hard disk or intranet content via the web. Zilverline can grab their content from PDF, Word, Excel, Powerpoint, RTF, txt, java, CHM, zip, rar and other documents to build abstracts and indexes. Results found from the local hard disk or intranet can be retrieved again. Zilverline supports multiple languages including Chinese.
XQEngine Open source search XQEngine
- XQEngine is a full text search engine for XML documents. Use XQuery as its front-end query language. It enables you to query a collection of XML documents by using a logical combination of keywords. A bit similar to how Google searches HTML documents with other search engines. XQEngine is just a very compact and embeddable component developed in Java.
MG4J Open source search MG4J
- MG4J lets you build a compressed full-text index for a large collection of documents, by using interpolative coding techniques.
JXTA Search JXTA Search
- JXTA Search is a distributed search system. Designed for use on peer-to-peer networks and websites.
YaCy Open source search YaCy
- YaCy is a p2p-based distributed web search engine. It is also an HTTP cache proxy server. This project is a new way to build a p2p web indexing network. It can search your own or global index, you can also crawl your own webpage or launch distributed crawling, etc.
Red-Piranha Open Source Search Red-Piranha
- Red-Piranha is an open source search system that truly "learns" what you are looking for. Red-Piranha can be used as a personal search engine for your desktop system (Windows, Linux and Mac), or an intranet search engine, or to provide search functions for your website, or as a P2P search engine, or combined with wiki as a knowledge / Document management solutions, or search for RSS feeds you want, or search your company's systems (including SAP, Oracle, or any other Database / Data source), or for managing PDF, Word, and other documents, or provide as a WebService for searching information or providing a search background for your application (Web, Swing, SWT, Flash, Mozilla-XUL, PHP, Perl or c # /. Net) and much more.
LIUS Open source search LIUS
- LIUS is an indexing framework based on the Jakarta Lucene project. LIUS adds indexing capabilities to many file formats for Lucene such as: Ms Word, Ms Excel, Ms PowerPoint, RTF, PDF, XML, HTML, TXT, Open Office sequences, and JavaBeans. Indexing for JavaBeans is particularly useful when we want to The database is indexed or the user happens to use the persistence layer ORM technology such as: Hibernate, JDO, Torque, TopLink for development.
Apache Solr Open source search Apache Solr
- Apache Solr is a high-performance, full-text search server based on Lucene, developed using Java 5. Documents are added to a search collection using XML via Http. Querying the collection is also achieved by receiving an XML / JSON response from http. Its main features include: efficient and flexible caching function, vertical search function, highlighting search results, improving usability through index replication, providing a powerful set of Data Schema to define fields, types and setting text analysis, providing web-based Management interface, etc.
Paoding Open Source Search Paoding
- Paoding Chinese word segmentation is a Chinese search engine word segmentation component developed for Java and integrated into Lucene applications for the Internet and intranet. Paoding fills in the gaps of open source components in domestic Chinese word segmentation, is committed to this and hopes to become the first Chinese word segmentation open source component for Internet sites. Paoding Chinese word segmentation seeks high efficiency of word segmentation and good user experience.
Carrot2 Open source search Carrot2
- Carrot2 is an open source search result classification engine. It can automatically organize the search results into some thematic categories. Carrot2 provides an architecture that can obtain search results from various search engines (YahooAPI, GoogleAPI, MSN Search API, eTools Meta Search, Alexa Web Search, PubMed, OpenSearch, Lucene index, SOLR).
Regain Open source search
- Regain is a desktop search engine system similar to Web search engines. The difference is that regain is not a search for Internet content, but a search for your own documents or files. Using regain can easily complete a large number of seconds Search of data (many G's). Regain uses Lucene's search syntax, so it supports multiple query methods, supports multi-index search and advanced search based on file type, and can implement URL rewriting and file-to-HTTP bridging. stand by.
- Regain offers two versions: desktop search and server search. Desktop Search provides a quick search for documents on ordinary desktop computers and web pages in a LAN environment. The server version is mainly installed on the web server to search for web sites and file servers in a LAN environment.