rob_w has quit [Quit: Leaving]
orivej has quit [Read error: Connection reset by peer]
orivej has joined #picolisp
_whitelogger has joined #picolisp
_whitelogger has joined #picolisp
_whitelogger has joined #picolisp
orivej has quit [Ping timeout: 246 seconds]
orivej has joined #picolisp
orivej has quit [Ping timeout: 250 seconds]
alexshendi has quit [Read error: Connection reset by peer]
alexshendi has joined #picolisp
<
tankf33der>
morning
<
tankf33der>
checking utf8 4th byte on this file
<
tankf33der>
comparing with older versions of pil
<
tankf33der>
im not sure what i doing
<
Regenaxer>
Good morning tankf33der
<
Regenaxer>
The 4th byte is used for characters > 65536
<
Regenaxer>
eg smileys
<
tankf33der>
my idea if i read with older version i got corrupted output
<
tankf33der>
and last char is not 9
<
Regenaxer>
Yes, you get corrupted data
<
tankf33der>
i do: in file until eof prinl char
<
Regenaxer>
The file must contain such characters
<
tankf33der>
i dont see last 9 on older version
<
tankf33der>
i will continue
<
Regenaxer>
What is "last 9"?
<
tankf33der>
this link above from github
<
tankf33der>
raw file from repo
<
tankf33der>
last number is 9
<
Regenaxer>
(in "thisFile" (read))
<
Regenaxer>
or (in "thisFile" (line))
<
Regenaxer>
I can input such characters with Penti directly
<
Regenaxer>
so it now also works in the REPL
<
Regenaxer>
Also things like (in "thisFile" (chop (read)))
<
Regenaxer>
and (pack @) then
<
tankf33der>
also different
<
tankf33der>
seems ok
<
tankf33der>
two versions of pil on the same file
<
tankf33der>
older pil on left
<
Regenaxer>
Great :)
<
Regenaxer>
Thanks for testing tankf33der
rob_w has joined #picolisp
rob_w has quit [Changing host]
rob_w has joined #picolisp
<
alexshendi>
Merry Christmas (somewhat belatedly).
alexshendi has quit [Read error: Connection reset by peer]
alexshendi has joined #picolisp
<
Regenaxer>
alexshendi: Merry Christmas to you too!
<
tankf33der>
doing dummy tests for UTF8
<
tankf33der>
downloaded utf8 files with full range in it, sum char char should be exactly number
<
tankf33der>
already see how old pil breaks after 65k
<
tankf33der>
will test future releases if required.
<
Regenaxer>
Cool, very good
alexshendi has quit [Ping timeout: 268 seconds]
libertas has quit [Ping timeout: 272 seconds]
libertas has joined #picolisp
orivej has joined #picolisp
<
tankf33der>
idea failed. i need something different.
alexshendi has joined #picolisp
alexshendi has quit [Read error: Connection reset by peer]
alexshendi has joined #picolisp
<
Regenaxer>
Sum char char would give which number?
<
Regenaxer>
yes, but how do you know the right result?
<
tankf33der>
yea, implement on another language
<
tankf33der>
since i cant find good test file i should generate it first
<
Regenaxer>
difficult
<
Regenaxer>
iirc there are unused ranges
<
Regenaxer>
Better use existing unicode stuff
<
Regenaxer>
There are many tables
<
Regenaxer>
In txt format, with comments etc
<
Regenaxer>
in @lib/wide.l
<
Regenaxer>
s/in/for/
<
tankf33der>
i know old and new pil understand all this
<
Regenaxer>
yes, should be. There seems nothing above 65536
<
Regenaxer>
It is mostly emoticons which are normally used in that high range
<
tankf33der>
i can generate file with bytes in range 65000 1000000
<
tankf33der>
i know golang says it all valid
<
Regenaxer>
It is exactly the range in hex 010000 .. 10FFFF
<
Regenaxer>
Nothing above that
<
Regenaxer>
UTF-4 four-byte 11110xxx 10xxxxxx 10xxxxxx 10xxxxxx
orivej has quit [Ping timeout: 250 seconds]
alexshendi has quit [Ping timeout: 246 seconds]
orivej has joined #picolisp
orivej has quit [Ping timeout: 246 seconds]
alexshendi has joined #picolisp
orivej has joined #picolisp
_whitelogger has joined #picolisp
orivej has quit [Ping timeout: 244 seconds]
alexshendi has quit [Ping timeout: 244 seconds]
orivej has joined #picolisp
freemint has joined #picolisp
<
tankf33der>
python3 vs. picolisp sum char char difference is always 3
<
tankf33der>
big progress.
<
tankf33der>
i will continue tomorrow.
<
tankf33der>
i wrote code on python
<
tankf33der>
if do char char from beginning of file i got:
<
tankf33der>
python3:
<
tankf33der>
forget above.
<
freemint>
the problem is that (char) in picolisp and python can be differently big and you adjusted 'char so it works almost like pythons?
<
tankf33der>
this is correct output from python and picolisp
<
tankf33der>
this is how difference 3 appears !
<
tankf33der>
i will continue tomorrow.
<
Regenaxer>
hmm, so there is no difference?
<
tankf33der>
piclisp rules
<
Regenaxer>
Same results?
<
tankf33der>
sum char char is equal on pyrhon and picolisp
<
Regenaxer>
ok, the sum of all chars in the file
<
Regenaxer>
Sorry, I cant look at imgur jpegs, they crash my tablet
<
freemint>
it is a hex dumpof he file in question
<
Regenaxer>
firefox uses too much memory when opening an imgur page, so it kills my terminal on my tablet with only 2 GiB of RAM
<
Regenaxer>
So good to hear it seems correct
<
tankf33der>
and pil from jun2018 not equal to python
<
Regenaxer>
yeah, until now it supported only up to 3 bytes UTF-8
<
tankf33der>
and pil32 and pil64 dec2018 are equal to python
<
Regenaxer>
like original Java
<
tankf33der>
sleep.
_whitelogger has joined #picolisp
rob_w has quit [Read error: Connection reset by peer]
freemint has quit [Quit: Page closed]