Language Extension Pack extends dtSearch’s built-in Unicode international language support to add customized noise word list and stemming rules for over 25 European languages. The dtSearch product line includes Unicode support, allowing indexing and searching of the many hundreds of languages supported by the Unicode standard. Supplementing dtSearch’s built-in Unicode support, dtSearch’s UK distributor (www.dtsearch.co.uk) provides a European Language Extension Pack, with customized noise word lists and stemming rules (to find different linguistic variations on the same root word) for over two dozen European languages. From a sample Cyrillic white paper authored by dtSearch’s UK distributor: dtSearch “includes a mapping from the Cyrillic ‘i’ to the Latin ‘i’ and thus if you search on web pages spidered by dtSearch you would find all the web pages, irrespective of whether you have a Ukrainian keyboard or a Russian keyboard and make the error of substituting the Latin ‘i’. It is this depth of experience that distinguishes dtSearch from many of the newer entrants to the world of search technology.” More from Cyrillic white paper; More on Language Extension Pack
dtSearch Product Line International Language Support Overview
Encyclopaedia Britannica’s cross-language morphological search plug in integrates with dtSearch. With a focus on Arabic, Farsi and other Middle Eastern Languages, Encyclopaedia Britannica has developed a rich product suite, which allows English speaking users to review and analyze foreign language source data. Components of the product suite include Britannica’s Cross Language Morphological Analysis (BMA), Cross Language Entity Extraction (EntX), and Embedded Translation Layer (ETL). “We are delighted to partner with dtSearch and provide state of the art foreign language solutions for our customers. Britannica’s morphology suite seamlessly integrates with the dtSearch Engine developer APIs, enabling users to use English language queries to search for foreign languages, overcoming morphological complexity and ambiguity. All methods enable smooth and transparent integration, adding Britannica’s language capabilities while maintaining the full range of dtSearch’s flexible search capacity.” More
Basis Technology’s Rosette® Linguistics Platform integration accessible through dtSearch API. The Rosette Linguistics Platform helps applications unlock the meaning of unstructured text by determining the language, and identifying the basic linguistic features and structure. Relying on code that is unique to each particular language, Rosette results in highly accurate Chinese, Japanese, Korean, Arabic and other international language morphological analysis. “We’re pleased to be working with dtSearch to provide their customers with solutions for enabling multilingual information processing.” More
From Intel® Software Partner Program Success Story with dtSearch: “‘We review performance parameters at every step of data access, data parsing, indexing, searching, and hit-highlighting. Intel® VTune™ Performance Analyzer excels at helping us optimize these processes as part of our development cycle.’” “Using the Intel® Concurrency Checker, available through the Intel® Software Partner Program, [dtSearch Corp.] then tested a dtSearch Engine sample application to simulate high-volume concurrent searching of a single shared index, similar to what might occur on a high-traffic web site. The dtSearch Engine multi-threaded indexed search demo achieved 100 percent parallel time in the Intel Concurrency Checker test, indicating full optimization for multi-core hardware under that test scenario.” “The relationship between Intel and dtSearch stretches back a number of years, helping dtSearch continue to develop in parallel with the evolution of client and server platforms. As a result, the combination of hardware and software generates synergies that deliver excellent performance and other benefits to end-customers, including internal customers at Intel.” More
Sovren teams with dtSearch for comprehensive recruitment developer component suite. A software-components-only firm, Sovren develops and markets a full suite of developer components for the recruitment market. Sovren’s solutions are multilingual and are used in job boards, assessment companies, applicant tracking systems, corporate HR, recruitment firms, research firms, and HRIS/HCM systems worldwide. “The dtSearch Engine is fast, effective and a perfect fit for the recruitment industry. Building on top of the dtSearch Engine APIs, we have added advanced text analysis specifically geared for the recruitment market.” More
dtSearch powers Web-based business intelligence mining library. From asp.netPRO (case study): Cybergroup is a developer of advanced Internet and intranet developer search tools. Cybergroup’s client required a Web-based business intelligence mining library, including Web-based searching. “A single dtSearch index could include both the SQL database and the separate document repository, including searching with all the above advanced search features, ranking capabilities, and hit-highlighted display options.” More
Cybergroup’s dbConnector and dbIndexer serve as a convenient dtSearch / database bridge. dbConnector serves as a developer bridge between dtSearch, ODBC-compliant databases such as SQL, and unstructured document data. For those projects where a developer simply desires to apply the power of dtSearch to database text columns, Cybergroup offers dbIndexer. “dtSearch lives up to its reputation as the most powerful search engine on the market. We found that not only was dtSearch searching powerful, including such features as fuzzy searching and relevance ranking, but dtSearch also supported a wide range of file formats. Combining our add-on products with dtSearch provides a powerful means to bring structure to unstructured information.” More
Cybergroup extends custom dtSearch development tools to focus on Web-based file management. The company provides a comprehensive range of Internet, Intranet, Extranet and Web site solutions and services, with a primary focus on Web to database integration. Its tools feature easy multi-user file management, and multiple levels of security. “Our tools are a set of companion products for the market leading search engine product dtSearch. As developers of custom dtSearch applications, we decided to create products that complement the fine features of dtSearch.” More
dtSearch is a proud sponsor of vNext_OC. Meeting schedule
Quicktionary Engine expands concept searching in dtSearch into mulltilingual dimensions. With Ligature’s Quicktionary Engine, a search for an English word can automatically retrieve its foreign language equivalents — or vice versa. “Quicktionary Engine takes the power of concept searching in dtSearch, and extends that automatically into multilingual dimensions. Quicktionary Engine translates – with lightning speed – your search term into other languages, and then submits the results along with the original terms into the dtSearch Engine. The result is super–powered international language search.” More
Content Analyst™ Technology platform adds advanced content analysis to dtSearching. Content Analyst is now offering a new Content Analyst Technology platform, combining Content Analyst’s specialized categorization and semantic analytics with the dtSearch Engine’s text and meta data searching. The platform provides extensive OEM customization options for its users. “We’ve combined semantic analytics and multiple language processing techniques with dtSearch’s own keyword, meta data, Boolean, fuzzy and other text search capabilities. The result is a major leap forward in information access technologies.” More
Dev Tool Cafe implements dtSearch. Dev Tool Cafe is a community for developer tool and software component users. “We [at Dev Tool Cafe] now have a fully working, self index maintaining search engine. (Thanks again to dtSearch for a great help file, and product.) What are you waiting for? Go on, use it ... its at the top of every page, helping you search for any content you may be looking for.” More
DevDirect.com uses dtSearch to help match developer product buyers and sellers. DevDirect.com links buyers and sellers of developer products, tools and components. “Accurately searching large amounts of information is at the heart of what Dev Direct does, so selecting a search engine supplier was a task not to be taken lightly ... dtSearch had proven credentials and offered such a rich array of capabilities and implementation styles, that we knew that we would be able to get what we wanted from it ... They didn’t just answer our questions though, they have proactively followed up the issues as we approached go-live, to make sure that everything was going OK - a rare and valuable commitment to customer service!” More
Apress® powers online SuperIndex™, covering the contents of the entire Apress library, with dtSearch. Apress includes hundreds of developer titles. For searching through the entire Apress library, Apress now offers the Apress SuperIndex. “Perfect for finding that snippet of code or reference to some obscure tool, the Apress SuperIndex enables all users to quickly access much needed information.” “Powered by dtSearch, it delivers results instantly.” The result is “instantaneous results” that are “engineered for speed and accuracy.” More
PCNet® nets dtSearch. PCNet, or pcnet-online.com, is the complete online resource for PC users. “I needed a search engine for PCNet. The aforementioned index server was getting more and more cranky and even though I had written custom code for the output I still wasn’t happy with it.” With dtSearch, “I have a world class search engine, all in less than 10 minutes ... Highly recommended.” More
Sherpa Software announces OEM versions of its email archiving and e-discovery product line. The Sherpa product line includes Archive Attender, Mail Attender, and Discovery Attender. The product line covers MS Outlook, MS Exchange, Lotus Notes, as well as other popular data formats. “Sherpa Software welcomes opportunities for third party manufacturers who wish to embed and offer Sherpa’s flagship email archiving and e–discovery technology or products, including an embedded version of the market–leading dtSearch Engine for search.” More
Kaleidosearch adds “out of the box” faceted search to a dtSearch Engine installation. Faceted search, the ability to dynamically filter search results by attributes, has remained out of reach for many organizations looking to add enhanced search options to their e-commerce stores, content-rich websites and other applications. Contegra Systems, designer of award-winning sites for Fortune 500 companies and other major online publishers, has now developed Kaleidosearch. Kaleidosearch lets customers integrate faceted search with a wide range of data sources for plug and play operation. Built on the dtSearch Engine, “the market-leading search platform,” Kaleidosearch “faceted search leverages your metadata to ensure successful search results and to eliminate dead-end searches.” Contegra “chose to partner with dtSearch after more than 10 years’ experience working with their API. We believe dtSearch offers far and away the best functionality and value in the market.” More
dtSearch won a People’s .NET Choice Award, Voted “Best Product” for Components-Search — MSD2D.com
Announcing New Developer Connector Libraries: AccessData Offers dtSearch Engine Users OEM Connectors Covering: • Microsoft Exchange, • Symantec Enterprise Vault (Exchange), • Oracle URM, • Lotus Notes (Domino Server) AccessData has taken its unparalleled expertise in the fields of computer forensics, cyber security and e-discovery, and used it to develop flexible and robust data connectors for a wide array of content repositories. AccessData connectors are extremely lightweight and accessible through .NET APIs. Each connector offers the ability to: iterate over the structure of the remote data store; retrieve incremental indexing information (IDs and timestamps); retrieve item metadata; retrieve item content; and store data in a dtSearch index. “These data connectors are purpose-built to integrate with dtSearch technology and allow for full text indexing of a repository’s content. These indexes can then be used to support a wide variety of business solutions including records management, information governance, internal investigations, audit and e-discovery.” More
I-Programmer adds articles on threading, hit highlighting, database indexing and caching to getting started with dtSearch and C# article. “I’m investigating dtSearch and I can tell you now, it’s a refreshing return to simplicity ... You can use dtSearch from any .NET language, Java or C++. In this case I’m going to use the .NET API and C# 4.0, but the ideas are more or less the same in any language because the same classes are provided to do the same job ... Yes it really is this easy.” More ("Getting Started with dtSearch" using C# at I-Programmer) “Although the main conclusion has to be that this is a really easy to use system, there are always considerations about how to do things in a slightly more sophisticated way. In this article we take a look at how to deal with big searches and the sorts of things you can do with what you find.” More ("Threading and dtSearch" using C# at I-Programmer) “While the output of the conversion is HTML (or RTF, or XML or plain text) the input file can be in any of the supported formats.” “A really nice touch is the provision of the special tags ... which causes the FileConverter to place numbers into the file corresponding to the index of the hit in the array.” More (“Hit highlighting with dtSearch” using C# at I-Programmer) “Take the same DataSource class and customize it to provide documents or raw text from any source you care to use – ODB, ADO.NET, LINQ, raw SQL, XML, RSS or any of the many web APIs.” More (“Full Text Database Indexing with dtSearch”) “So far we have ignored one intriguing option” with dtSearch: “you can opt to store the contents of a document within the index" where “the document to be cached could be stored on a website, a cloud datastore or a file share.” “There are so many way to use the text or files cached in the index,” including “to generate a search report using the cache.” Here, “the report consists of all of the hits, with the hits highlighted using bold and an additional 10 words of context surrounding the hit.” More ("Document Caching with dtSearch")
EggHeadCafe is “the .NET developer’s portal of choice.” From EggHeadCafe Newsletter: “We migrated our search capabilities over to dtSearch ... This has improved the accuracy of search queries as well as given us much better search result summaries.” More
Pinpoint Labs’ SafeCopy 2 integrates with dtSearch products to provide forensically-sound electronically stored information (ESI) collection for dtSearch users. Pinpoint Labs specializes in computer forensics software and services. The company’s SafeCopy 2 integrates with dtSearch for ESI chain of custody handling of retrieved files. dtSearch provides “a great way to do text searches” and “can be used effectively to create file lists that can be then used by SafeCopy to collect those files ... a lot of programs already have dtSearch integrated into them.” For details on forensically-sound ESI file collection through SafeCopy for dtSearch users, please visit Pinpoint’s Webinar presentation.
Bitext integrates dtSearch with NaturalFinder linguistic analysis suite. Bitext develops linguistic technologies for natural language understanding in different languages, including Spanish and English. Its NaturalFinder product suite integrates with the dtSearch Engine to provide enhanced linguistic analysis in web-based and other environments. NaturalFinder also includes DataNet, which expedites semantic relations management, and DataSpell, which detects spelling and typographic errors and suggests the correct search query. “We liked the quality of dtSearch’s documentation and code samples. The readiness of its technical support service made integration a simple and low-cost task.” More