This project is read-only.


Rating:        Based on 1 rating
Reviewed:  1 review
Downloads: 2620
Change Set: c1bae4d32ab9
Released: Jun 7, 2013
Updated: Jun 16, 2013 by IvanAkcheurov
Dev status: Beta Help Icon

Recommended Download

Application Binaries
application, 6344K, uploaded Jun 7, 2013 - 2620 downloads

Release Notes


  • Updated the version
  • Pushed NTextCat to NuGet
    • Please don't forget to include a language profile in your delivery (with your exe, website, etc.). It is located here: <YOURSOLUTION>\packages\IvanAkcheurov.NTextCat.Lib.\Core14.profile.xml
  • Minor restructuring of the solution

NTextCat 0.2.1

  • Recommended length of a text snippet has been reduced to 5 (though mostly a single word is handled correctly).
  • Much better support for Asian languages (Chinese, Japanese).
  • Simplified and made more consistent API. (examples of usage in unit tests)

// Don't forget to deploy a language profile (e.g. Core14.profile.xml) with your application.
// You can do it with referencing a language profile as content that should be copied to output
// in your application project (.exe, website, etc.).
// Language profiles are located either in <NTextCatRelease>\LanguageModels\
// or <YOURSOLUTION>\packages\IvanAkcheurov.NTextCat.Lib.\ (if you get NTextCat from NuGet)
var factory = new RankedLanguageIdentifierFactory();
var identifier = factory.Load("Core14.profile.xml");
var res = identifier.Identify("your text to get its language identified");
  • Fixed NaiveBayesLanguageIdentifier so that it performs as good as RankedLanguageIdentifier
  • NTextCat.exe provides the main command line interface from now on (it's command line API may be changed in several subsequent releases).
  • Based on the feedback, a set of 14 the most popular languages has been selected. It has become a default. The set: Chinese, Danish, Dutch, English, French, German, Italian, Japanese, Korean, Norwegian, Portuguese, Russian, Spanish, Swedish
  • SqlServerClrIntegration is not in the release yet. It will be reintroduced in one of the next releases recompiled and verified for SQL Server 2012.
  • Fixed a bug in GaussianBag
  • More rigid testing routines as preparations to produce a stable release.

Reviews for this release

Great library!
by ShreddedSoul on Dec 23, 2013 at 6:52 AM