Open source search engine

Tags: popularity rank indexer html hypertext relevance ispell domain names hypertext references web cgi insensitive search boolean query xml text virtual url url schemes sql databases thai languages url scheme search phrases audio mpeg website group search module


Datapark Corp.

DataparkSearch Engine is a full-featured open sources web-based search engine released under the GNU General Public License and designed to organize search within a website, group of websites, intranet or local system. DataparkSearch consists of two parts. The first part is indexing mechanism (indexer). Indexer walks over html hypertext references and stores found words and new references into database. The second part is web CGI front-end to provide search using data collected by indexer. Key features: Support for http, https, ftp, nntp and news URL schemes; htdb virtual URL scheme support for indexing SQL databases; text/html, text/xml, text/plain,audio/mpeg (MP3) and image/gif mime types built-in support; External parsers support for other document types; Ability to index multilingual sites using content negotiation; Searching all of the word forms using ispell affixes and dictionaries; Fuzzy searching based on acronyms and abbreviations. Stopwords and synonyms lists; Boolean query language support; Results sorting by relevance, popularity rank, last modified time and by importance (a multiplication of relevance and popularity rank); Various character sets support; Accent insensitive search; Phrases segmenting for Chinese, Japanese, Korean and Thai languages; mod_dpsearch - search module for Apache web server; Internationalized Domain Names support; The Summary Extraction Algorithm.

Software Price: Freeware
Software Version: 4.44
Release Date
: 1/22/2007

Size: 2.11 MB
Platform: Unix, Linux

Download Link: DataparkSearch Download

Keywords: free sql dpsearch software dataparksearch indexer search engine apache


  Search by keyword:      

Copyright © 2003-2012