You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- All `#TODO` comments should be turned into [issues](https://github.com/pythainlp/pythainlp/issues) in GitHub;
29
-
- When appropriate, use [f-String](https://www.python.org/dev/peps/pep-0498/)
29
+
- When appropriate, use [f-string](https://www.python.org/dev/peps/pep-0498/)
30
30
(use `f"{a} = {b}"`, instead of `"{} = {}".format(a, b)` and `"%s = %s' % (a, b)"`);
31
-
- All text files, including source code, must be ended with one empty line. This is [to please git](https://stackoverflow.com/questions/5813311/no-newline-at-end-of-file#5813359) and [to keep up with POSIX standard](https://stackoverflow.com/questions/729692/why-should-text-files-end-with-a-newline).
31
+
- All text files, including source codes, must end with one empty line. This is [to please git](https://stackoverflow.com/questions/5813311/no-newline-at-end-of-file#5813359) and [to keep up with POSIX standard](https://stackoverflow.com/questions/729692/why-should-text-files-end-with-a-newline).
32
32
33
33
### Version Control System
34
34
35
35
- We use [Git](http://git-scm.com/) as our [version control system](http://en.wikipedia.org/wiki/Revision_control),
36
36
so it may be a good idea to familiarize yourself with it.
37
37
- You can start with the [Pro Git book](http://git-scm.com/book/) (free!).
38
38
39
-
### Commit Comment
39
+
### Commit Message
40
40
41
41
-[How to Write a Git Commit Message](https://chris.beams.io/posts/git-commit/)
42
42
-[Commit Verbs 101: why I like to use this and why you should also like it.](https://chris.beams.io/posts/git-commit/)
@@ -45,24 +45,24 @@ so it may be a good idea to familiarize yourself with it.
45
45
46
46
- We use the famous [gitflow](http://nvie.com/posts/a-successful-git-branching-model/)
47
47
to manage our branches.
48
-
- When you do pull request on GitHub, Travis CI and AppVeyor will run tests
48
+
- When you create pull requests on GitHub, Github Actions and AppVeyor will run tests
49
49
and several checks automatically. Click the "Details" link at the end of
50
50
each check to see what needs to be fixed.
51
51
52
52
53
53
## Documentation
54
54
55
55
- We use [Sphinx](https://www.sphinx-doc.org/en/master/) to generate API document
56
-
automatically from "docstring" comments in source code. This means the comment
57
-
section in the source code is important for the quality of documentation.
58
-
- A docstring should start with one summary line, ended the line with a full stop (period),
59
-
then followed by a blank line before the start new paragraph.
60
-
- A commit to release branches (e.g. `2.2`, `2.1`) with a title **"(build and deploy docs)"** (without quotes) will trigger the system to rebuild the documentation files and upload them to the website https://pythainlp.github.io/docs
56
+
automatically from "docstring" comments in source codes. This means the comment
57
+
section in the source codes is important for the quality of documentation.
58
+
- A docstring should start with one summary line, end with one line with a full stop (period),
59
+
then be followed by a blank line before starting a new paragraph.
60
+
- A commit to release branches (e.g. `2.2`, `2.1`) with a title **"(build and deploy docs)"** (without quotes) will trigger the system to rebuild the documentation files and upload them to the website https://pythainlp.github.io/docs.
61
61
62
62
63
63
## Testing
64
64
65
-
We use standard Python `unittest`. Test suites are in `tests/` directory.
65
+
We use standard Python `unittest`. The test suite is in `tests/` directory.
66
66
67
67
To run unit tests locally together with code coverage test:
68
68
@@ -81,12 +81,12 @@ Generate code coverage test in HTML (files will be available in `htmlcov/` direc
81
81
coverage html
82
82
```
83
83
84
-
Make sure the same tests pass on Travis CI and AppVeyor.
84
+
Make sure the tests pass on both Github Actions and AppVeyor.
85
85
86
86
87
87
## Releasing
88
88
- We use [semantic versioning](https://semver.org/): MAJOR.MINOR.PATCH, with development build suffix: MAJOR.MINOR.PATCH-devBUILD
89
-
-Use[`bumpversion`](https://github.com/c4urself/bump2version/#installation) to manage versioning.
89
+
-We use[`bumpversion`](https://github.com/c4urself/bump2version/#installation) to manage versioning.
90
90
-`bumpversion [major|minor|patch|release|build]`
91
91
- Example:
92
92
```
@@ -129,18 +129,18 @@ Make sure the same tests pass on Travis CI and AppVeyor.
Thanks all the[contributors](https://github.com/PyThaiNLP/pythainlp/graphs/contributors). (Image made with [contributors-img](https://contributors-img.firebaseapp.com))
132
+
Thanks to all[contributors](https://github.com/PyThaiNLP/pythainlp/graphs/contributors). (Image made with [contributors-img](https://contributors-img.firebaseapp.com))
133
133
134
-
### Development Lead
135
-
- Wannaphong Phatthiyaphaibun <[email protected]> - founder, distribution and maintainance
136
-
- Korakot Chaovavanich - initial tokenization and soundex code
134
+
### Development Leads
135
+
- Wannaphong Phatthiyaphaibun <[email protected]> - foundation, distribution and maintenance
136
+
- Korakot Chaovavanich - initial tokenization and soundex codes
137
137
- Charin Polpanumas - classification and benchmarking
138
138
- Peeradej Tanruangporn - documentation
139
-
- Arthit Suriyawongkul - refactoring, packaging, distribution, and maintainance
139
+
- Arthit Suriyawongkul - refactoring, packaging, distribution, and maintenance
Copy file name to clipboardExpand all lines: README.md
+21-21Lines changed: 21 additions & 21 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -13,13 +13,13 @@
13
13
<ahref="https://matrix.to/#/#thainlp:matrix.org"rel="noopener"target="_blank"><imgsrc="https://matrix.to/img/matrix-badge.svg"alt="Chat on Matrix"></a>
14
14
</div>
15
15
16
-
PyThaiNLP is a Python package for text processing and linguistic analysis, similar to [NLTK](https://www.nltk.org/) with focus on Thai language.
16
+
PyThaiNLP is a Python package for text processing and linguistic analysis, similar to [NLTK](https://www.nltk.org/) with a focus on the Thai language.
> Now, You can contact or ask any questions with the PyThaiNLP team. <ahref="https://matrix.to/#/#thainlp:matrix.org"rel="noopener"target="_blank"><imgsrc="https://matrix.to/img/matrix-badge.svg"alt="Chat on Matrix"></a>
22
+
> Now, You can contact with or ask any questions of the PyThaiNLP team. <ahref="https://matrix.to/#/#thainlp:matrix.org"rel="noopener"target="_blank"><imgsrc="https://matrix.to/img/matrix-badge.svg"alt="Chat on Matrix"></a>
PyThaiNLP provides standard NLP functions for Thai, for example part-of-speech tagging, linguistic unit segmentation (syllable, word, or sentence). Some of these functions are also available via command-line interface.
40
+
PyThaiNLP provides standard NLP functions for Thai, for example part-of-speech tagging, linguistic unit segmentation (syllable, word, or sentence). Some of these functions are also available via the command-line interface.
41
41
42
42
<details>
43
43
<summary>List of Features</summary>
@@ -48,11 +48,11 @@ PyThaiNLP provides standard NLP functions for Thai, for example part-of-speech t
48
48
- Thai spelling suggestion and correction (`spell` and `correct`)
49
49
- Thai transliteration (`transliterate`)
50
50
- Thai soundex (`soundex`) with three engines (`lk82`, `udom83`, `metasound`)
51
-
- Thai collation (sort by dictionary order) (`collate`)
51
+
- Thai collation (sorted by dictionary order) (`collate`)
52
52
- Read out number to Thai words (`bahttext`, `num_to_thaiword`)
For dependency details, look at `extras` variable in [`setup.py`](https://github.com/PyThaiNLP/pythainlp/blob/dev/setup.py).
95
+
For dependency details, look at the `extras` variable in [`setup.py`](https://github.com/PyThaiNLP/pythainlp/blob/dev/setup.py).
96
96
97
97
98
-
## Data directory
98
+
## Data Directory
99
99
100
-
- Some additional data, like word lists and language models, may get automatically download during runtime.
100
+
- Some additional data, like word lists and language models, may be automatically downloaded during runtime.
101
101
- PyThaiNLP caches these data under the directory `~/pythainlp-data` by default.
102
-
-Data directory can be changed by specifying the environment variable `PYTHAINLP_DATA_DIR`.
102
+
-The data directory can be changed by specifying the environment variable `PYTHAINLP_DATA_DIR`.
103
103
- See the data catalog (`db.json`) at https://github.com/PyThaiNLP/pythainlp-corpus
104
104
105
105
106
106
## Command-Line Interface
107
107
108
-
Some of PyThaiNLP functionalities can be used at command line, using`thainlp` command.
108
+
Some of PyThaiNLP functionalities can be used via command line with the`thainlp` command.
109
109
110
-
For example, displaying a catalog of datasets:
110
+
For example, to display a catalog of datasets:
111
111
```sh
112
112
thainlp data catalog
113
113
```
114
114
115
-
Showing how to use:
115
+
To show how to use:
116
116
```sh
117
117
thainlp help
118
118
```
@@ -122,16 +122,16 @@ thainlp help
122
122
123
123
|| License |
124
124
|:---|:----|
125
-
| PyThaiNLP Source Code and Notebooks|[Apache Software License 2.0](https://github.com/PyThaiNLP/pythainlp/blob/dev/LICENSE)|
125
+
| PyThaiNLP source codes and notebooks|[Apache Software License 2.0](https://github.com/PyThaiNLP/pythainlp/blob/dev/LICENSE)|
126
126
| Corpora, datasets, and documentations created by PyThaiNLP |[Creative Commons Zero 1.0 Universal Public Domain Dedication License (CC0)](https://creativecommons.org/publicdomain/zero/1.0/)|
127
127
| Language models created by PyThaiNLP |[Creative Commons Attribution 4.0 International Public License (CC-by)](https://creativecommons.org/licenses/by/4.0/)|
128
-
| Other corpora and models that may included with PyThaiNLP | See [Corpus License](https://github.com/PyThaiNLP/pythainlp/blob/dev/pythainlp/corpus/corpus_license.md)|
128
+
| Other corpora and models that may be included in PyThaiNLP | See [Corpus License](https://github.com/PyThaiNLP/pythainlp/blob/dev/pythainlp/corpus/corpus_license.md)|
129
129
130
130
131
131
## Contribute to PyThaiNLP
132
132
133
-
- Please do fork and create a pull request :)
134
-
- For style guide and other information, including references to algorithms we use, please refer to our [contributing](https://github.com/PyThaiNLP/pythainlp/blob/dev/CONTRIBUTING.md) page.
133
+
- Please fork and create a pull request :)
134
+
- For style guides and other information, including references to algorithms we use, please refer to our [contributing](https://github.com/PyThaiNLP/pythainlp/blob/dev/CONTRIBUTING.md) page.
135
135
136
136
## Who uses PyThaiNLP?
137
137
@@ -140,13 +140,13 @@ You can read [INTHEWILD.md](https://github.com/PyThaiNLP/pythainlp/blob/dev/INTH
140
140
141
141
## Citations
142
142
143
-
If you use `PyThaiNLP` in your project or publication, please cite the library as follows
143
+
If you use `PyThaiNLP` in your project or publication, please cite the library as follows:
144
144
145
145
```
146
146
Wannaphong Phatthiyaphaibun, Korakot Chaovavanich, Charin Polpanumas, Arthit Suriyawongkul, Lalita Lowphansirikul, & Pattarawat Chormai. (2016, Jun 27). PyThaiNLP: Thai Natural Language Processing in Python. Zenodo. http://doi.org/10.5281/zenodo.3519354
147
147
```
148
148
149
-
or BibTeX entry:
149
+
or by BibTeX entry:
150
150
151
151
```bib
152
152
@misc{pythainlp,
@@ -166,7 +166,7 @@ or BibTeX entry:
166
166
| Logo | Description |
167
167
| --- | ----------- |
168
168
|[](https://airesearch.in.th/)| Since 2019, our contributors Korakot Chaovavanich and Lalita Lowphansirikul have been supported by [VISTEC-depa Thailand Artificial Intelligence Research Institute](https://airesearch.in.th/). |
169
-
|[](https://www.macstadium.com)| We get support free Mac Mini M1 from [MacStadium](https://www.macstadium.com) for doing Build CI. |
169
+
|[](https://www.macstadium.com)| We get support of free Mac Mini M1 from [MacStadium](https://www.macstadium.com) for running CI builds. |
170
170
171
171
------
172
172
@@ -181,5 +181,5 @@ or BibTeX entry:
181
181
</div>
182
182
183
183
<divalign="center">
184
-
<strong>Beware of malware if you use code from mirrors other than the official two at GitHub and GitLab.</strong>
184
+
<strong>Beware of malware if you use codes from mirrors other than the official two on GitHub and GitLab.</strong>
0 commit comments