Loading...
Computers and technology — programming languages, software, hardware, internet services, security, artificial intelligence, and more. Explore thousands of tech resources organized by a knowledgeable community of editors.
56203 resources
Effort to implement a prototype of an open source web-search engine.
Open source, cross-platform distributed crawler. FAQ, documentation and a support forum.
A PHP, GPLv3 search engine designed to do open web or intranet crawls.
A cross-platform search engine written in C++ that provides text search and a rich structured query language. BSD-like license.
An open source, high precision corporate search engine based on Apache Nutch
An LGPL 2.1, open-source, fulltext search engine and column store written in C. Works with MySQL and Postgres. Site provides online documentation and downloads.
A search engine designed for indexing database content. It natively supports MySQL, PostgreSQL, and XML pipe interfaces. It is written in C++ and has a GPL license.
Java-based Apache licensed enterprise web crawler running on any platform, and integrating with virtually any search engines (open-source or commercial).
A collection of C++ (C++98) libraries and command line tools for building a competitive full-text search engine. Development status is pre-alpha.
A packaged, Apache v2-licensed, enterprise search solution that leverages ManifoldCF for data sources, Solr for the search engine, and Cassandra for user management.
A C++, GPL-licensed search engine developed at the University of Waterloo. Wumpus allows control of the text unit retrieved based on structural constraints in the query.
Open source search engine tool released under GPL and designed to organize search within a website, group of websites, intranet or local system.
An open source search engine based on Common Crawl
An open source SDK for building distributed web crawlers based on Apache Storm.
A PHP and MySQL/MariaDB based system to automatically crawl the content of a website(s).
A tool for finding code by looking at the applications' GUI text messages (e.g., "Undo") and returning associated callbacks/slots (e.g., slotUndo()). Allows searching the KDE project CVS repository as a live demonstration.
A web robot, search engine and web server written in Java and available under GPL. Includes related resources. [Project no longer actively updated]
An open source web spider and search engine. Includes demo, source code and screenshots.
A lightweight search engine in PHP. Includes details of features, documentation, support forum, and download. [GPL]
Open source search engine library written in C++, with bindings to allow use from other languages as well.
Specifically designed for knowledge area or corporate search, written in C++.
A .NET web crawler written in C# using SQL 2005 and Lucene. Documentation and online demonstration.
A distributed Web crawler and caching HTTP/HTTPS proxy built on the principles of peer-to-peer (P2P) networks.
A GPLv3 search engine and crawler for urls, databases, and file systems. Comes with an XML/HTTP API, PHP/ASP client. Based on Apache Tomcat, Java Server Faces and JBoss RichFaces.