`markov_text` - A Text Generator Based on Higher-Order Markov Chains

Installing and Building

Install CMake (at least version 3.11)
While in this directory run cmake -B build
After (2) finishes, run cmake --build build
Everything should be done now!

Running

Write ./build/markov_text -h for help.

An example usage is given below, where first the construction command is done:

./build/markov_text -c corpus -O 3 -o out

which will construct an order-3 Markov chain based on the large text file corpus and save it as four files, starting with out. Note that -O 3 (order 3) and -o out (output file path out) are the default and can be omitted. Thus, calling ./build/markov_text -c corpus will be equivalent to the command above.

Then to generate text, run:

./build/markov_text -g out -s 100

which will generate at most 100 tokens based on the chain that is stored in the files starting with out. Note that the value -s 100 (generate at most 100 tokens) is the default value and can be omitted. Thus, calling ./build/markov_text -g out is equivalent to the command above.

Notes

It is undefined behaviour if the input file has fewer tokens than the order of the constructed chain
The reason for the generation of "at most" N tokens is that if the Markov Chain has no next state then the text generation process ends. This can happen when the current sequence of tokens is a unique sequence that appears at the end of the input text file. This can be produced be creating a file when K unique tokens then generating N < K tokens. In this case, at most K tokens will be produced.

Contributions

Contributions and feedback are more than welcome!

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
src		src
.clang-format		.clang-format
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

`markov_text` - A Text Generator Based on Higher-Order Markov Chains

Installing and Building

Running

Notes

Contributions

About

Uh oh!

Languages

License

AzeezDa/markov_text

Folders and files

Latest commit

History

Repository files navigation

markov_text - A Text Generator Based on Higher-Order Markov Chains

Installing and Building

Running

Notes

Contributions

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages

`markov_text` - A Text Generator Based on Higher-Order Markov Chains