dtSearch Products
dtSearch provides simple to use but very powerful tools which create and
maintain full text indexes of documents and data. Anything that contains
text can be indexed, even databases. It handles very large collections of
documents, and terabytes of text can be searched across a desktop, network,
internet or intranet site. dtSearch offers a desktop version, a web version,
and a publishing version for distribution of document collections on CD or DVD.
dtSearch products can meet the needs of organizations looking for an affordable
solution to finding information stored in diverse formats in multiple
locations.
Suggested applications include:
-
As your intranet or internet search engine.
- For spidering selected websites and building your own custom index
or subject portal.
- For indexing collections of documents or files.
- For indexing email (including attachments).
- For federated searching across multiple dtSearch indexes and other databases.
Key benefits:
Fast and flexible searching
Searching, even of many terabytes of text, is extremely fast, because it draws
upon an index that stores the location of words in files, without altering the
original files in any way. There are many powerful search options available:
| fuzzy |
find search terms even if they are misspelled (0-10 scale) |
| phonic |
finds words that sound alike, like Smythe in a search for Smith |
| synonym |
finds quick, speedy, etc. in a search for fast |
| stemming |
applies, applied, applying in a search for apply |
These are just a few examples - the list includes proximity matching, wildcards,
built-in and user-created thesauri, boolean, exact phrase match, all words
match, any words match, field and document attribute filters, and more. We
especially like the fuzzy matching capability, as it finds search terms even
with typos and OCR errors - always a reality when searching documents and full
text from a wide range of sources!
Hit highlighting
Search results are displayed with the search terms or "hits" highlighted and
with the original formatting, links and images intact. The initial results list
can show :
-
Document or web page title
-
Synopsis, i.e. a brief snippet of text showing the first hits in context.
-
Hit count
-
Date and size
To avoid delays, only the page with the first hit highlighted will be downloaded
initially. The remainder will be downloaded as the user browses through the
document.
Ease of use
Setup is fast and easy - an index and default web search interface can be set
up within just a few minutes, i.e.
-
Specify what to index i.e. select the folders or website or Outlook folder
-
Include or exclude specific file types
-
For updates to the index use Windows Task Scheduler and
-
Choose to index only new or modified documents
-
Remove deleted or missing items from the index
-
Specify web interface parameters
-
Choose the indexes to search
-
Choose to enable a log file to track searches for later analysis
-
Choose options to display, ie. fuzzy searching, synonyms etc.
-
Specify items to include in search results list
-
Choose document display formatting options
-
After the search form is generated you can edit the HTML to customize the
appearance or cut and paste the code into other pages.
Customization
dtSearch offers excellent possibilities for integrating the search interface
into a clients template and for building custom applications using a .NET
developers programming interface. Andornot can customize the out-of-the-box
layout to your requirements or can assist you in building more complex
applications. Check out the
Search
functionality on our website to see how dtSearch can be
integrated with a website interface.
Supported File and Data Types
Document types and data which can be indexed by dtSearch include:
-
Office formats (word processor, spreadsheet, presentation documents)
-
PDF (supports "image with hidden text" format)
-
Email (Outlook and Eudora message stores, including attachments)
-
Web pages (HTML, XML, ASP, ASP.NET, etc.)
-
Databases (Any database that supports data access, e.g. through ODBC)
-
Unicode support for indexing and searching of non-English text
-
Automatic detection of fields in document summaries, XML, HTML, PDF etc.
Contact us for more information.
|
Andornot is an authorized dtSearch Corp. developer.
See dtSearch in
action as the search engine for our website and more!
Andornot Combines dtSearch and Inmagic for Document Discovery
For a recent project, we built a web application for a prominent
New York law firm for use in large cases involving hundreds of gigabytes of
document data. The documents, all PDF's averaging thousands of pages in size,
were first scanned with OCR technology, and then indexed and made searchable
with dtSearch. Fast searches across the collection allowed significant
information to be quickly identified. Relevant pages were extracted into new
PDF documents, and key value-added information was entered into an Inmagic
database, all from a single browser-based interface.
|