Skip to content

dsal3389/tokenizer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 

Repository files navigation

tokenizer

a simple generic code tokenizer written in C, the tokenizer doesn't support specific language, it will just group words and strings

build

make build

example

# test.py

print("hello world")

def foo() -> int:
    return 0

output for tokenizer test.py -gs

(NEWLINE)
print
(
"hello world"
)
(NEWLINE)
(NEWLINE)
(NEWLINE)
#
(SPACE)
comment
(NEWLINE)
def
(SPACE)
foo
(
)
(SPACE)
-
>
(SPACE)
int
:
(NEWLINE)
(SPACEx4)
return
(SPACE)
0

About

generic code tokenizer

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published