BigARTM
  • Introduction
  • Downloads
  • Tutorials
    • Basic BigARTM tutorial for Windows users
      • Download
      • Running BigARTM from command line
      • Configure BigARTM Python API
      • Running BigARTM from Python API
    • Basic BigARTM tutorial for Linux and Mac OS-X users
      • Download sources and build
      • Running BigARTM from command line
      • Configure BigARTM Python API
      • Troubleshooting
      • BigARTM on Travis-CI
    • Typical python example
      • Parse collection
      • MasterComponent
      • Configure Topic Model
      • Invoke Iterations
      • Retrieve and visualize scores
  • Whitepapers
    • Enabling Basic BigARTM Regularizers
    • BigARTM as a Service
  • BigARTM FAQ
    • Can I use BigARTM from other programming languages (not Python)?
    • How to retrieve Theta matrix from BigARTM
  • BigARTM Developer’s Guide
    • Downloads (Windows)
    • Source code
    • Build C++ code on Windows
    • Python code on Windows
    • Build C++ code on Linux
    • Compiling .proto files on Windows
    • Code style
    • Intel Math Kernel Library
  • BigARTM Reference
    • BigARTM: The Algorithm Under The Hood
    • Messages
      • DoubleArray
      • FloatArray
      • BoolArray
      • IntArray
      • Item
      • Field
      • Batch
      • Stream
      • MasterComponentConfig
      • ModelConfig
      • RegularizerConfig
      • SmoothSparseThetaConfig
      • SmoothSparsePhiConfig
      • DecorrelatorPhiConfig
      • LabelRegularizationPhiConfig
      • RegularizerInternalState
      • DictionaryConfig
      • DictionaryEntry
      • ScoreConfig
      • ScoreData
      • PerplexityScoreConfig
      • PerplexityScore
      • SparsityThetaScoreConfig
      • SparsityThetaScore
      • SparsityPhiScoreConfig
      • SparsityPhiScore
      • ItemsProcessedScoreConfig
      • ItemsProcessedScore
      • TopTokensScoreConfig
      • TopTokensScore
      • ThetaSnippetScoreConfig
      • ThetaSnippetScore
      • TopicKernelScoreConfig
      • TopicKernelScore
      • TopicModel
      • ThetaMatrix
      • CollectionParserConfig
      • SynchronizeModelArgs
      • InitializeModelArgs
      • GetTopicModelArgs
      • GetThetaMatrixArgs
      • GetScoreValueArgs
      • AddBatchArgs
      • InvokeIterationArgs
      • WaitIdleArgs
      • ExportModelArgs
      • ImportModelArgs
    • Python Interface
      • Library
      • MasterComponent
      • Model
      • Regularizer
      • Score
      • Dictionary
      • Visualizers
      • Exceptions
      • Constants
    • Plain C interface of BigARTM
      • Introduction
      • ArtmCreateMasterComponent
      • ArtmReconfigureMasterComponent
      • ArtmDisposeMasterComponent
      • ArtmCreateModel
      • ArtmReconfigureModel
      • ArtmDisposeModel
      • ArtmCreateRegularizer
      • ArtmReconfigureRegularizer
      • ArtmDisposeRegularizer
      • ArtmCreateDictionary
      • ArtmReconfigureDictionary
      • ArtmDisposeDictionary
      • ArtmAddBatch
      • ArtmInvokeIteration
      • ArtmSynchronizeModel
      • ArtmInitializeModel
      • ArtmExportModel
      • ArtmImportModel
      • ArtmWaitIdle
      • ArtmOverwriteTopicModel
      • ArtmRequestThetaMatrix
      • ArtmRequestTopicModel
      • ArtmRequestRegularizerState
      • ArtmRequestScore
      • ArtmRequestParseCollection
      • ArtmRequestLoadDictionary
      • ArtmRequestLoadBatch
      • ArtmCopyRequestResult
      • ArtmSaveBatch
      • ArtmGetLastErrorMessage
      • Error codes
    • C++ interface
      • MasterComponent
      • Model
      • Regularizer
      • Dictionary
      • Utility methods
    • BigARTM command line utility
    • Windows distribution
  • Release Notes
    • BigARTM v0.7.0 Release notes
      • New-style models
      • Network modus operandi is removed
      • Coherence regularizer and scores (experimental)
    • BigARTM v0.7.1 Release notes
      • BigARTM notebooks
      • ArtmModel
      • Coding Phi-regularizers in Python code
      • Other changes
      • Breaking changes
  • Publications
  • Legacy documentation pages
    • Tutorial
 
BigARTM
  • Docs »
  • Downloads
  • Edit on GitHub

DownloadsΒΆ

  • Windows

    • Latest 32 bit release: https://github.com/bigartm/bigartm/releases/download/v0.7.1/BigARTM_v0.7.1_win32.7z
    • Latest 64 bit release: https://github.com/bigartm/bigartm/releases/download/v0.7.1/BigARTM_v0.7.1_x64.7z
    • All previous releases are available at https://github.com/bigartm/bigartm/releases

    Please refer to Basic BigARTM tutorial for Windows users for step by step installation procedure.

  • Linux, Mac OS-X

    To run BigARTM on Linux and Mac OS-X you need to clone BigARTM repository (https://github.com/bigartm/bigartm) and build it as described in Basic BigARTM tutorial for Linux and Mac OS-X users.

  • Datasets

    Task Source #Words #Items Files
    kos UCI 6906 3430
    • docword.kos.txt.gz (1 MB)
    • vocab.kos.txt (54 KB)
    • kos_1k (700 KB)
    • kos_dictionary
    nips UCI 12419 1500
    • docword.nips.txt.gz (2.1 MB)
    • vocab.nips.txt (98 KB)
    • nips_200 (1.5 MB)
    • nips_dictionary
    enron UCI 28102 39861
    • docword.enron.txt.gz (11.7 MB)
    • vocab.enron.txt (230 KB)
    • enron_1k (7.1 MB)
    • enron_dictionary
    nytimes UCI 102660 300000
    • docword.nytimes.txt.gz (223 MB)
    • vocab.nytimes.txt (1.2 MB)
    • nytimes_1k (131 MB)
    • nytimes_dictionary
    pubmed UCI 141043 8200000
    • docword.pubmed.txt.gz (1.7 GB)
    • vocab.pubmed.txt (1.3 MB)
    • pubmed_10k (1 GB)
    • pubmed_dictionary
    wiki Gensim 100000 3665223
    • enwiki-20141208_10k (1.2 GB)
    • enwiki-20141208_1k (1.4 GB)
    • enwiki-20141208_dictionary (3.6 MB)
    wiki_enru Wiki 196749 216175
    • wiki_enru (282 MB)
    • wiki_enru_dictionary (5.3 MB)
    • class_id(s): @english, @russian
Next Previous

© Copyright 2015, Konstantin Vorontsov.

Built with Sphinx using a theme provided by Read the Docs.
Read the Docs v: v0.7.1
Versions
latest
stable
v0.7.1
v0.7.0
v0.6.4
v0.6.3
v0.6.2
v0.6.1
v0.6.0
v0.5.9
v0.5.8
v0.5.7
v0.5.6
v0.5.5
v0.5.4
Downloads
On Read the Docs
Project Home
Builds

Free document hosting provided by Read the Docs.