MeTA v2.1.0 Release Notes

Release Date: 2016-02-13 // about 8 years ago
  • ๐Ÿ†• New features

    • โž• Add the GloVe algorithm for
      training word embeddings and a library class word_embeddings for loading and
      querying trained embeddings. To facilitate returning word embeddings, a simple
      util::array_view class was added.
    • โž• Add simple vector math library (and move fastapprox into the math
      namespace).

    ๐Ÿ› Bug fixes

    • Fix probe_map::extract() for inline_key_value_storage type; old
      implementation forgot to delete all sentinel values before returning the
      vector.
    • ๐Ÿ›  Fix incorrect definition of l1norm() in sgd_model.
    • ๐Ÿ›  Fix gmap calculation where 0 average precision was ignored
    • ๐Ÿ›  Fix progress output in multiway_merge.

    โœจ Enhancements

    • ๐Ÿ‘Œ Improve performance of printing::progress. Before, progress::operator() in
      ๐ŸŽ tight loops could dramatically hurt performance, particularly due to frequent
      calls to std::chrono::steady_clock::now(). Now, progress::operator()
      simply sets an atomic iteration counter and a background thread periodically
      โšก๏ธ wakes to update the progress output.
    • ๐Ÿ‘ Allow full text storage in index as metadata field. If store-full-text = true (default false) in the corpus config, the string metadata field
      "content" will be added. This is to simplify the creation of full text
      ๐Ÿ“‡ metadata: the user doesn't have to duplicate their dataset in metadata.dat,
      ๐Ÿ“‡ and metadata.dat will still be somewhat human-readable without large strings
      of full text added.
    • ๐Ÿ‘ Allow make_index to take a user-supplied corpus object.

    Miscellaneous

    • ZLIB is now a required dependency.
    • โœ… Switch to just using the standalone ./unit-test instead of ctest. There
      โœ… aren't really many advantages for us to using CTest at this point with the new
      โœ… unit test framework, so just use our unit test executable.