14 Jan 09:15

cedricrupb

v0.2.1

6797bcf

code_tokenize v0.2.1 Latest

Latest

This releases is mainly for bug fixes and for updating dependencies. It should no change anything drastically.

Assets 2

28 Jun 11:07

cedricrupb

v0.2.0

2705388

code_tokenize v0.2.0

Major API redesign

code_tokenize in v0.2.0 makes now mainly use of the visitor pattern for parsing the AST

Changes

tokenize parses source code now by parsing the AST and traversing the AST via a visitor
custom tokenizing visitors can be defined per language
For Python, we correct the tokenization process: the indentation is now AST based computed
Code is extensively tested in parsing large libraries (Python and Java)
more languages are closer integrated

Assets 2

19 Jan 18:53

cedricrupb

v0.1.0

37ec10a

code_tokenize v0.1.0

First main release of code.tokenize

First version to extend the functionality of the underlying AST parser.

Changes

tokenize parses source code now with language specific configuration
For Python, we automatically detect indentations and add special tokens
Code is now extensively tested in parsing large libraries (Python and Java)
Update documentation to make usage more easier

Minor features (still under test)

AST path based detection of token types (detection of variable usages, definitions or function calls)
Language specific configuration for Java

Assets 2

01 Nov 16:14

cedricrupb

v0.0.1

afe6e62

code_tokenize v0.0.1

The first version of code(dot)tokenize.
The version introduces the following features:

Introduction of Token API
AST backed tokenization: The token interface enables easy access to the complete AST structure
Fast AST parsing backend based on Tree-Sitter
Full support of Tree-Sitter: Currently, all languages which are supported by Tree-Sitter can be tokenized
Auto loading: The parser definition for each language is automatically downloaded

Minor features (still under test):

Convention based statement head identification (the starting token of an statement)
Convention based statement splitting

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Major API redesign

Changes

First main release of code.tokenize

Changes

Releases: cedricrupb/code_tokenize

code_tokenize v0.2.1

code_tokenize v0.2.0

Major API redesign

Changes

code_tokenize v0.1.0

First main release of code.tokenize

Changes

code_tokenize v0.0.1