aboutsummaryrefslogtreecommitdiff
path: root/textproc/amberfish
Commit message (Collapse)AuthorAgeFilesLines
* - Switch SourceForge ports to the new File Release System: categories ↵Dmitry Marakasov2009-08-221-1/+2
| | | | | | | starting with T,U,V Notes: svn path=/head/; revision=240076
* - reset MAINTAINER per requestYen-Ming Lee2009-05-061-1/+1
| | | | | | | | PR: 134253 Submitted by: giffunip@tutopia.com Notes: svn path=/head/; revision=233296
* Amberfish is general purpose text retrieval software, developed at EtymonMartin Wilke2008-09-306-0/+527
by Nassib Nassar and distributed as open source software under the terms of version 2 of the GNU General Public License (GPL). Its distinguishing features are indexing/search of semi-structured text (i.e. both free tex and multiply nested fields), built-in support for XML documents using the Xerces library, structured queries allowing generalized field/tag paths, hierarchical result sets (XML only), automatic searching across multiple databases (allowing modular indexing), TREC format results, efficient indexing, and relatively low memory requirements during indexing (and the ability to index documents larger than available memory). Z39.50 support is available. Other features include Boolean queries, right truncation, phrase searching, relevance ranking, support for multiple documents per file, incremental indexing, and easy integration with other UNIX tools, The architecture is also designed to permit proximity queries; however, they are not fully implemented at present. WWW: http://www.etymon.com/tr.html This port also includes the Porter stemming algorithm for suffix stripping, available at: http://www.tartarus.org/~martin/PorterStemmer PR: ports/127580 Submitted by: Pedro Giffuni Notes: svn path=/head/; revision=221052