ggteixeira/corpus-cleaner
44
Linguistic tool (made by a linguist, for linguists) that scraps corpora, automatically cleans it up, and generates n-grams.
What's novel
Linguistic tool (made by a linguist, for linguists) that scraps corpora, automatically cleans it up, and generates n-grams.
Score Breakdown
Innovation
3 (25%)
Craft
27 (35%)
Traction
11 (15%)
Scope
58 (25%)
Signal breakdown
Innovation
Not Fork+1
Code Novelty+0
Unique Niche+1
Concept Novelty+1
Craft
Ci-3
Tests-5
Polish+0
Releases-2
Has License+0
Code Quality+12
Readme Quality+8
Recent Activity+7
Structure Quality+5
Commit Consistency+0
Has Dependency Mgmt+5
Traction
Forks+0
Stars+6
Hn Points+0
Watchers+3
Early Traction+0
Devto Reactions+0
Community Contribs+2
Scope
Commits+5
Languages+3
Subsystems+5
Bloat Penalty+0
Completeness+6
Contributors+6
Authored Files+15
Readme Code Match+3
Architecture Depth+7
Implementation Depth+8
Evidence
Commits
10
Contributors
2
Files
150
Active weeks
1
TestsCI/CDREADMELicenseContributing
Repository
Language
Python
Stars
1
Forks
0
License
—