diff --git a/README.md b/README.md index e2ad0612..93bde2c5 100644 --- a/README.md +++ b/README.md @@ -70,6 +70,7 @@ Please take a quick gander at the [contribution guidelines](https://github.com/a ## Contents - [Awesome Go](#awesome-go) + - [Contents](#contents) - [Audio and Music](#audio-and-music) - [Authentication and OAuth](#authentication-and-oauth) - [Blockchain](#blockchain) @@ -1609,12 +1610,14 @@ _Libraries for working with human languages._ See also [Text Processing](#text-processing). ### Language Detection + - [detectlanguage](https://github.com/detectlanguage/detectlanguage-go) - Language Detection API Go Client. Supports batch requests, short phrase or single word language detection. - [getlang](https://github.com/rylans/getlang) - Fast natural language detection package. - [guesslanguage](https://github.com/endeveit/guesslanguage) - Functions to determine the natural language of a unicode text. - [whatlanggo](https://github.com/abadojack/whatlanggo) - Natural language detection package for Go. Supports 84 languages and 24 scripts (writing systems e.g. Latin, Cyrillic, etc). ### Morphological Analyzers + - [go-stem](https://github.com/agonopol/go-stem) - Implementation of the porter stemming algorithm. - [go2vec](https://github.com/danieldk/go2vec) - Reader and utility functions for word2vec embeddings. - [golibstemmer](https://github.com/rjohnsondev/golibstemmer) - Go bindings for the snowball libstemmer library including porter 2. @@ -1632,11 +1635,13 @@ See also [Text Processing](#text-processing). - [spaGO](https://github.com/nlpodyssey/spago) - Self-contained Machine Learning and Natural Language Processing library in Go. ### Slugifiers + - [go-slugify](https://github.com/mozillazg/go-slugify) - Make pretty slug with multiple languages support. - [slug](https://github.com/gosimple/slug) - URL-friendly slugify with multiple languages support. - [Slugify](https://github.com/avelino/slugify) - Go slugify application that handles string. ### Tokenizers + - [gojieba](https://github.com/yanyiwu/gojieba) - This is a Go implementation of [jieba](https://github.com/fxsjy/jieba) which a Chinese word splitting algorithm. - [gotokenizer](https://github.com/xujiajun/gotokenizer) - A tokenizer based on the dictionary and Bigram language models for Golang. (Now only support chinese segmentation) - [gse](https://github.com/go-ego/gse) - Go efficient text segmentation; support english, chinese, japanese and other. @@ -1649,6 +1654,7 @@ See also [Text Processing](#text-processing). - [textcat](https://github.com/pebbe/textcat) - Go package for n-gram based text categorization, with support for utf-8 and raw text. ### Translation + - [go-i18n](https://github.com/nicksnyder/go-i18n/) - Package and an accompanying tool to work with localized text. - [go-localize](https://github.com/m1/go-localize) - Simple and easy to use i18n (Internationalization and localization) engine - used for translating locale strings. - [go-mystem](https://github.com/dveselov/mystem) - CGo bindings to Yandex.Mystem - russian morphology analyzer. @@ -1659,6 +1665,7 @@ See also [Text Processing](#text-processing). - [t](https://github.com/youthlin/t) - Another i18n pkg for golang, which follows GNU gettext style and supports .po/.mo files: `t.T (gettext)`, `t.N (ngettext)`, etc. And it contains a cmd tool [xtemplate](https://github.com/youthlin/t/blob/main/cmd/xtemplate), which can extract messages as a pot file from text/html template. ### Transliteration + - [enca](https://github.com/endeveit/enca) - Minimal cgo bindings for [libenca](https://cihar.com/software/enca/), which detects character encodings. - [go-unidecode](https://github.com/mozillazg/go-unidecode) - ASCII transliterations of Unicode text. - [gounidecode](https://github.com/fiam/gounidecode) - Unicode transliterator (also known as unidecode) for Go. @@ -2160,6 +2167,7 @@ _Libraries for parsing and manipulating texts._ See also [Natural Language Processing](#natural-language-processing). ### Formatters + - [address](https://github.com/bojanz/address) - Handles address representation, validation and formatting. - [align](https://github.com/Guitarbum722/align) - A general purpose application that aligns text. - [bytes](https://github.com/labstack/gommon/tree/master/bytes) - Formats and parses numeric byte values (10K, 2M, 3G, etc.). @@ -2169,6 +2177,7 @@ See also [Natural Language Processing](#natural-language-processing). - [textwrap](https://github.com/isbm/textwrap) - Wraps text at end of lines. Implementation of `textwrap` module from Python. ### Markup Languages + - [bafi](https://github.com/mmalcek/bafi) - Universal JSON, BSON, YAML, XML translator to ANY format using templates. - [bbConvert](https://github.com/CalebQ42/bbConvert) - Converts bbCode to HTML that allows you to add support for custom bbCode tags. - [blackfriday](https://github.com/russross/blackfriday) - Markdown processor in Go. @@ -2182,6 +2191,7 @@ See also [Natural Language Processing](#natural-language-processing). - [toml](https://github.com/BurntSushi/toml) - TOML configuration format (encoder/decoder with reflection). ### Parsers/Encoders/Decoders + - [allot](https://github.com/sbstjn/allot) - Placeholder and wildcard text parsing for CLI tools and bots. - [codetree](https://github.com/aerogo/codetree) - Parses indented code (python, pixy, scarlet, etc.) and returns a tree structure. - [commonregex](https://github.com/mingrammer/commonregex) - A collection of common regular expressions for Go. @@ -2205,16 +2215,19 @@ See also [Natural Language Processing](#natural-language-processing). - [xj2go](https://github.com/stackerzzq/xj2go) - Convert xml or json to go struct. ### Regular Expressions + - [genex](https://github.com/alixaxel/genex) - Count and expand Regular Expressions into all matching Strings. - [go-wildcard](https://github.com/IGLOU-EU/go-wildcard) - Simple and lightweight wildcard pattern matching. - [goregen](https://github.com/zach-klippenstein/goregen) - Library for generating random strings from regular expressions. - [regroup](https://github.com/oriser/regroup) - Match regex expression named groups into go struct using struct tags and automatic parsing. ### Sanitation + - [bluemonday](https://github.com/microcosm-cc/bluemonday) - HTML Sanitizer. - [gofuckyourself](https://github.com/JoshuaDoes/gofuckyourself) - A sanitization-based swear filter for Go. ### Scrapers + - [colly](https://github.com/asciimoo/colly) - Fast and Elegant Scraping Framework for Gophers. - [dataflowkit](https://github.com/slotix/dataflowkit) - Web scraping Framework to turn websites into structured data. - [GoQuery](https://github.com/PuerkitoBio/goquery) - GoQuery brings a syntax and a set of features similar to jQuery to the Go language. @@ -2224,10 +2237,12 @@ See also [Natural Language Processing](#natural-language-processing). - [xurls](https://github.com/mvdan/xurls) - Extract urls from text. ### RSS + - [podcast](https://github.com/eduncan911/podcast) - iTunes Compliant and RSS 2.0 Podcast Generator in Golang - [syndfeed](https://github.com/zhengchun/syndfeed) - A syndication feed for Atom 1.0 and RSS 2.0. ### Utility/Miscellaneous + - [go-runewidth](https://github.com/mattn/go-runewidth) - Functions to get fixed width of the character or string. - [go-zero-width](https://github.com/trubitsyn/go-zero-width) - Zero-width character detection and removal for Go. - [kace](https://github.com/codemodus/kace) - Common case conversions covering common initialisms.