dtSearch Products
dtSearch provides simple to use but very powerful tools which
create and maintain full text indexes of documents and data.
Anything that contains text can be indexed, even databases. It
handles very large collections of documents, and terabytes of text
can be searched across a desktop, network, internet or intranet
site. dtSearch offers a desktop version, a web version, and a
publishing version for distribution of document collections on CD
or DVD.
dtSearch products can meet the needs of organizations looking
for an affordable solution to finding information stored in diverse
formats in multiple locations.
Suggested applications include:
- As your intranet or internet search engine.
- For spidering selected websites and building your own custom
index or subject portal.
- For indexing collections of documents or files.
- For indexing email (including attachments).
- For federated searching across multiple dtSearch indexes and
other databases.
Key benefits:
Fast and flexible searching
Searching, even of many terabytes of text, is extremely fast,
because it draws upon an index that stores the location of words in
files, without altering the original files in any way. There are
many powerful search options available:
| fuzzy |
find search terms even if they are misspelled (0-10
scale) |
| phonic |
finds words that sound alike, like Smythe in a
search for Smith |
| synonym |
finds quick, speedy, etc. in a search for fast |
| stemming |
applies, applied, applying in a search for
apply |
These are just a few examples - the list includes proximity
matching, wildcards, built-in and user-created thesauri, boolean,
exact phrase match, all words match, any words match, field and
document attribute filters, and more. We especially like the fuzzy
matching capability, as it finds search terms even with typos and
OCR errors - always a reality when searching documents and full
text from a wide range of sources!
Hit highlighting
Search results are displayed with the search terms or "hits"
highlighted and with the original formatting, links and images
intact. The initial results list can show :
- Document or web page title
- Synopsis, i.e. a brief snippet of text showing the first hits
in context.
- Hit count
- Date and size
To avoid delays, only the page with the first hit highlighted
will be downloaded initially. The remainder will be downloaded as
the user browses through the document.
Ease of use
Setup is fast and easy - an index and default web search interface
can be set up within just a few minutes, i.e.
- Specify what to index i.e. select the folders or website or
Outlook folder
- Include or exclude specific file types
- For updates to the index use Windows Task Scheduler and
- Choose to index only new or modified documents
- Remove deleted or missing items from the index
- Specify web interface parameters
- Choose the indexes to search
- Choose to enable a log file to track searches for later
analysis
- Choose options to display, ie. fuzzy searching, synonyms
etc.
- Specify items to include in search results list
- Choose document display formatting options
- After the search form is generated you can edit the HTML to
customize the appearance or cut and paste the code into other
pages.
Customization
dtSearch offers excellent possibilities for integrating the search
interface into a clients template and for building custom
applications using a .NET developers programming interface.
Andornot can customize the out-of-the-box layout to your
requirements or can assist you in building more complex
applications. Check out the Search functionality
on our website to see how dtSearch can be integrated with a website
interface.
Supported File and Data Types
Document types and data which can be indexed by dtSearch
include:
- Office formats (word processor, spreadsheet, presentation
documents)
- PDF (supports "image with hidden text" format)
- Email (Outlook and Eudora message stores, including
attachments) How to link.
- Web pages (HTML, XML, ASP, ASP.NET, etc.)
- Databases (Any database that supports data access, e.g. through
ODBC)
- Unicode support for indexing and searching of non-English
text
- Automatic detection of fields in document summaries, XML, HTML,
PDF etc.
Contact us for more
information.