#picolisp on 2019-01-14 — irc logs at freenode.irclog.whitequark.org

2018-09-14 18:41 ChanServ changed the topic of #picolisp to: PicoLisp language | Channel Log: https://irclog.whitequark.org/picolisp/ | Check also http://www.picolisp.com for more information

00:18 ubLIX has joined #picolisp

00:29 xkapastel has quit [Quit: Connection closed for inactivity]

01:04 shpx has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

01:06 shpx has joined #picolisp

01:10 shpx has quit [Client Quit]

01:20 ubLIX has quit [Quit: ubLIX]

01:48 viaken has quit [Quit: fuck you]

02:03 viaken has joined #picolisp

02:03 <viaken> Sorry about that. Forgot my quit message would go to everyone.

02:05 andyjpb has quit [Ping timeout: 272 seconds]

02:41 <rick42> viaken: lol

05:02 shpx has joined #picolisp

05:20 _whitelogger has joined #picolisp

05:24 shpx has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

05:24 orivej has quit [Ping timeout: 258 seconds]

05:27 shpx has joined #picolisp

05:29 orivej has joined #picolisp

05:43 shpx has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

05:45 shpx has joined #picolisp

05:53 shpx has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

06:51 rob_w has joined #picolisp

07:57 orivej has quit [Ping timeout: 268 seconds]

07:59 orivej has joined #picolisp

09:09 <Nistur> mornin'

09:26 alexshendi has quit [Ping timeout: 258 seconds]

09:51 <Regenaxer> Hi Nistur

10:01 aw- has quit [Ping timeout: 268 seconds]

10:04 <beneroth> Good morning Nistur

10:04 <beneroth> Good morning Regenaxer :)

10:05 <Nistur> o/

10:11 <Regenaxer> Hi beneroth

10:27 aw- has joined #picolisp

10:35 longshi has joined #picolisp

12:20 andyjpb has joined #picolisp

12:24 longshi has quit [Ping timeout: 252 seconds]

13:23 shpx has joined #picolisp

13:24 orivej has quit [Ping timeout: 244 seconds]

13:26 orivej has joined #picolisp

13:56 _whitelogger has joined #picolisp

14:19 orivej has quit [Ping timeout: 240 seconds]

14:35 shpx has quit [Quit: Textual IRC Client: www.textualapp.com]

14:45 longshi has joined #picolisp

14:57 ubLIX has joined #picolisp

14:58 longshi has quit [Ping timeout: 252 seconds]

15:15 <andyjpb> Is anyone going to FOSDEM this year?

15:19 * beneroth is not going to FOSDEM this year.

15:29 longshi has joined #picolisp

15:39 rob_w has quit [Remote host closed the connection]

15:43 <tankf33der> next task

15:43 <tankf33der> https://adventofcode.com/2018/day/13

17:05 freemint has joined #picolisp

17:12 <freemint> Regenaxer: back to puzzling with database again

17:13 <freemint> I am currently using {x} where x is a number. I noticed that {9} Bad input '9'

17:18 <freemint> I am trying to parse db created with pool. I am trying to parse block0 but i do not know where << starts

17:20 orivej has joined #picolisp

17:26 xkapastel has joined #picolisp

17:27 alexshendi has joined #picolisp

17:28 <alexshendi> Good evening!

17:34 freemint has quit [Ping timeout: 256 seconds]

17:39 <Regenaxer> Hi alexshendi!

17:40 <alexshendi> Hi Regenaxer, how is life?

17:40 <Regenaxer> Good :)

17:40 <Regenaxer> Nothing new

17:43 freemint has joined #picolisp

17:44 <freemint> Regenaxer: how to i parse a the block size out of a db file?

17:45 ubLIX has quit [Quit: ubLIX]

17:45 <Regenaxer> The '<<' byte is a shift count

17:45 <Regenaxer> 64 shift left by that number

17:45 <Regenaxer> {8} is illegal, must be an octal number

17:45 <freemint> Ok,but how do i get << out of the 0block.

17:46 <freemint> does the 0 block have a constant size independent of file block size?

17:46 <Regenaxer> no

17:46 <Regenaxer> see doc64/structures

17:46 <freemint> how large is a box in structures?

17:46 <Regenaxer> it is byte-oriented

17:47 <Regenaxer> Why do you need that info?

17:47 <Regenaxer> I mean, it is known a priori

17:48 <Regenaxer> Anyway, BLK is 6

17:48 <Regenaxer> in doc64/structures

17:49 <freemint> so the 13th byte is the shift factor?

17:49 <Regenaxer> I think the 12th

17:50 <Regenaxer> Free and Next are both 6 bytes

17:51 <freemint> so >> is part of next

17:52 <freemint> (in the same 6 bytes as next)

17:52 <Regenaxer> no

17:52 <Regenaxer> nope

17:52 <Regenaxer> offset 12 is the shift count

17:52 <Regenaxer> Historically size was always 64

17:52 <Regenaxer> so this byte was zero

17:52 <freemint> Do you start at 0 or 1 when counting bytes?

17:53 <Regenaxer> offset 12

17:53 <Regenaxer> Free and Next are both 6 bytes

17:54 <freemint> Yes

17:54 <Regenaxer> Is your block size scale factor not known?

17:54 <Regenaxer> 'dbs'

17:54 <freemint> (dbs) isNIL

17:55 <Regenaxer> Normally you don't need to parse that field

17:55 <Regenaxer> No!

17:55 <Regenaxer> The app calls (dbs ...

17:55 <Regenaxer> see the examples

17:55 <Regenaxer> app/er.l

17:55 <Regenaxer> any er.l

17:55 <freemint> ? (pool "dbtst") -> T ? (dbs) -> NIL

17:56 <Regenaxer> family.l:

17:56 <Regenaxer> (dbs

17:56 <Regenaxer> (0) # (1 . 64)

17:56 <Regenaxer> (2 +Person) # (2 . 256)

17:56 <Regenaxer> (3 (+Person nm)) # (3 . 512)

17:56 <Regenaxer> (3 (+Person job dat fin)) ) # (4 . 512)

17:56 <Regenaxer> 'dbs' *sets* it

17:56 <Regenaxer> not returns it

17:56 <Regenaxer> (doc 'dbs)

17:56 <freemint> So calling dbs is not an optimization but a necessity for the pilDB to work?

17:57 <Regenaxer> It is not necessary, but then you have a single file with factor 2

17:57 <Regenaxer> (like (dbs (2))

17:58 <Regenaxer> 'dbs' modifies the relations, and sets *Dbs

17:58 <Regenaxer> which is then passed to 'pool'

17:59 <rick42> hello peeps (don't let me interrupt you though :)

17:59 <Regenaxer> So it is for tuning the DB

17:59 <freemint> ok and after the byte which contains 2 starts the first block of a size of (<< 64 2)

17:59 <Regenaxer> Hi rick42!

17:59 <rick42> Hi Regenaxer and freemint!

17:59 <freemint> Hi rick42

17:59 <freemint> Hi alexshendi

17:59 <Regenaxer> No, all blocks are same size, also the first

18:00 <Regenaxer> So the first data block starts at 256

18:00 <Regenaxer> if factor is 2

18:00 <rick42> ah alexshendi! hello!

18:00 <freemint> or 64 if the factor is 0?

18:00 <Regenaxer> yep

18:00 <Regenaxer> This factor is *used* only upon file creation

18:01 <Regenaxer> Later *Dbs is ignored

18:01 <Regenaxer> and the factor in the root block is used

18:01 <Regenaxer> *Dbs is used only then to know how many files to open

18:01 <freemint> so the factor in the root block says how to read the file regardless of *Dbs

18:01 <Regenaxer> the sizes are taken from the files

18:02 <Regenaxer> yes

18:02 <rick42> metadata nice

18:02 <Regenaxer> The size cannot be changed

18:02 <Regenaxer> after the file was created

18:02 <freemint> (not without rebuilding the file or not at all?)

18:03 <Regenaxer> You need to 'dump' the objects, delete the file, and import

18:04 <Regenaxer> May be impossible if the new block size is smaller

18:04 <Regenaxer> well, no, possible

18:04 <Regenaxer> but leads to more fragmentation

18:04 <freemint> I just followed the same train of thought

18:05 <Regenaxer> @lib/db32-64.l in fact does something like that

18:05 <Regenaxer> I used it to port old 64-only DBs to the new system

18:06 <Regenaxer> Single-file DBs with 64 blocks

18:06 <Regenaxer> 64-byte blocks

18:07 <Regenaxer> oh, no

18:07 <Regenaxer> this is porting 32-bit DB to pil64

18:07 <Regenaxer> But similar

18:08 <freemint> whatwas the reason to allow flexible blocksize?

18:09 <Regenaxer> Efficiency

18:09 <Regenaxer> Large objects may take too many blocks

18:09 <Regenaxer> possibly fragmented

18:09 <freemint> have you seen a lot of fragmentation?

18:10 <Regenaxer> I never measured

18:10 <Regenaxer> But it surely will

18:10 <freemint> have you meassured performance improvements?

18:10 <Regenaxer> Not exactly

18:11 <Regenaxer> Imagine a B~tree node of 4 KiB

18:11 <Regenaxer> or 1 K

18:11 <freemint> i understand the theory

18:11 <Regenaxer> it would be 16 blocks

18:12 <Regenaxer> possibly spread over the full file size

18:12 <Regenaxer> means 16 seeks and 16 reads

18:12 <Regenaxer> more for larger symbols

18:12 <Regenaxer> so a flexible size is a *must*

18:13 <freemint> It is just that when i do stuff like that you cry "premature optimazation" ;)

18:13 <Regenaxer> no, it is different

18:13 <Regenaxer> I know the limits

18:13 <Regenaxer> used single file DB for 10 years

18:14 <freemint> hey das war ein Witz

18:14 <Regenaxer> and hit limits for huge DBs

18:14 <Regenaxer> yeah :)

18:14 <freemint> anyway. I now got a better understand how the DB works at the header level.

18:14 <Regenaxer> So it was not premature, but too slow for the Smapper projects

18:15 <Regenaxer> ok

18:15 <freemint> all so it cleared up my confusion about *Dbs and why it is not set on import

18:15 <Regenaxer> Also very important is to have critical indexes in their private file

18:16 <Regenaxer> Which import do you mean?

18:17 <freemint> on pool

18:17 <freemint> Can i try to summarize?

18:17 <Regenaxer> So "open"

18:17 <Regenaxer> yes

18:19 <freemint> Oh a question appeared what was NEXT and FREE good for. I Think next is the offset of the root cell

18:20 <Regenaxer> Next is the next free block, ie. the end of the file

18:20 <Regenaxer> Free is the start of the free list

18:20 <Regenaxer> ie deleted blocks

18:21 <Regenaxer> Next is redundant for normal files, but you can also use d /dev/ directly (without filesystem), so there is no EOF

18:21 <freemint> So Next is rougly the size of the file (number of blocks) and free is a pointer to the next free cell

18:22 <Regenaxer> yes, to the *first* free cell perhaps

18:22 <Regenaxer> a linked list of free blocks

18:22 <freemint> what to the first free cell perhaps?

18:22 <Regenaxer> How do you mean that?

18:23 <Regenaxer> the "what"

18:23 <freemint> Free is the index of the first free cell or Next is the index of the first free cell?

18:23 <Regenaxer> blocks, not cells here

18:23 <Regenaxer> Free is the avail list

18:24 <freemint> s/cells/blocks

18:24 <Regenaxer> Next is size of the file divided by blocksize

18:24 <freemint> thanks

18:24 <freemint> that makes perfect sense when explained that way

18:25 <Regenaxer> all these "pointers" must be shifted by the scale factor

18:25 <freemint> i am getting something wrong or can a file only contain 2^5 blocks?

18:26 <Regenaxer> no

18:26 <Regenaxer> 2**42 (4 Tera) Blocks per file

18:26 <freemint> 2^(6*8-1)

18:26 <Regenaxer> 2**16 Files -> 256 Peta objects

18:26 <Regenaxer> T

18:26 <freemint> i tihnk it is only 2^41

18:27 <Regenaxer> It is 42

18:28 <Regenaxer> 6 * 7

18:28 <freemint> i made a mistake

18:29 <freemint> question why only 6*(8-1) and not (6*8)-1 ?

18:29 <Regenaxer> It is 6 bytes (48), but 6 bits reserved

18:29 <freemint> ahh

18:29 <freemint> i thought only the LSB was always 0

18:30 longshi has quit [Ping timeout: 252 seconds]

18:30 <freemint> +-------------+-+-------------+-+----+

18:30 <freemint> Block 0: | Free 0| Next 0| << |

18:30 <freemint> +-------------+-+-------------+-+----+

18:31 <freemint> suggest that only the last bit is reserved in free

18:31 <Regenaxer> I dont remember new

18:31 <freemint> well that limit in unlikely to be hit.

18:31 <Regenaxer> The lowest 6 bits in a pointer are reserved

18:32 <Regenaxer> marker for first block, and following

18:32 <Regenaxer> eg see "ID-Block:"

18:33 <Regenaxer> It result from the non-shifted min of 64 bytes

18:33 <freemint> summary time

18:38 <freemint> A PicoLisp DB file contains a header inside it's first block, since the header is always smaller than 64 bits (the smallest possible blocksize) there is no problem.

18:38 <Regenaxer> right

18:43 <freemint> The header contains several flags, the offset of the next free block (which in turn points to the next free block, creating a list of blocks to recycle which maybe fragmented), and the current block count in the allocated space for the database (stored in next).

18:44 <freemint> and the block size.

18:44 <Regenaxer> correct

18:44 <Regenaxer> "block count" is more clear than "next"

18:46 Regenaxer has left #picolisp [#picolisp]

18:46 Regenaxer has joined #picolisp

18:46 <Regenaxer> oops :)

18:46 <freemint> The layout of the header is as follows: Free spans 6 bytes, but 6 bit of that are reserved giving a pilDB file a total adress space of 2^42.

18:46 <Regenaxer> right

18:46 <Regenaxer> the 'n' in

18:46 <Regenaxer> EXT-Block: | Link n| Data

18:47 <Regenaxer> is max 63

18:47 <Regenaxer> so if a symbol has more blocks, they all have 63

18:47 <Regenaxer> The 'n' is used only by 'dbck'

18:47 <Regenaxer> consistency check

18:48 <freemint> The block count (refered as NEXT in the documentation) is also 6 bytes with 6 bit reserved for flags. Again 2^42 blocks

18:48 <Regenaxer> If it is 0, it is used in several places

18:48 <Regenaxer> yes

18:48 <Regenaxer> 0 is used by 'seq' to find the next ID block

18:49 <Regenaxer> Skipping EXT-Blocks

18:49 <Regenaxer> So it mostly checks for zero or non-zero

18:53 <freemint> The next byte (byte 12 when starting to count with 0) encodes the blocksize. It is used a shift factor which shifts the smallest possible blocksize (64 bit) to the left. If a value of 2 (default value) is picked the blocksize 256 bytes. It is rare to see values larger than 7.

18:54 <Regenaxer> Correct

18:54 <Regenaxer> even 6 is seldom

18:54 <Regenaxer> I never used 7

18:54 <Regenaxer> but who knows?

18:54 <freemint> Is there any information in the header i did not document (other than the flags?)

18:55 <Regenaxer> No, that's all

18:55 <freemint> with 7 you get bigger than most hard disk sectors

18:55 <Regenaxer> 8192 bytes

18:55 <Regenaxer> very small hard disk

18:56 <Regenaxer> ah, "sector"

18:56 <freemint> I mean https://en.wikipedia.org/wiki/Disk_sector not disk size

18:57 <Regenaxer> yes

18:57 <Regenaxer> the sectors are pretty irrelevant, not even known

18:57 <Regenaxer> it reads full tracks

18:57 <Regenaxer> and caches them

18:57 <Regenaxer> Logical sector size is 512 still (?)

18:58 <Regenaxer> Unix blocksize is 8192 usually

18:58 <Regenaxer> buffer size used by stdio etc.

18:59 <freemint> But these disk sectors are only relevant to performance and have only a minor impact

19:00 <Regenaxer> Not even relevant to performance any more I suspect

19:00 <Regenaxer> The sectors are completely tranparent as I see it

19:00 <freemint> Opposed to the block size, which can result in a lot of jumping when to small (slowing performance a lot) or in a lot of uselessly transfared 0 and increased file size and more ram use

19:00 <freemint> when to big

19:01 <Regenaxer> Yep

19:02 <freemint> Since you want refer records of different sizes in a database and these are stored in different files. I am curious how pilDB refers objects in ther databases. I suspect ext is involved

19:03 <Regenaxer> 'ext' is for other DBs (not the one opened by 'pool')

19:04 <freemint> ohh

19:06 <freemint> A PicoLisp db maybe either a file with a single blocksize or a folder of different db files (with differing or the same blocks size) and a folder for Blobs (Binary objects not stored in the db).

19:06 <Regenaxer> correct

19:08 <freemint> So how can i refer from a large object in file A to a small one in B

19:09 <Regenaxer> The size does not matter. It is simply in the data

19:09 <Regenaxer> Typically a +Link or +Joint

19:09 <Regenaxer> An external symbol is a first class object

19:09 <freemint> My problem is that with the offset we can only point to location in the current file.

19:10 <Regenaxer> no, it encodes both file AND block

19:10 <Regenaxer> {A7}

19:10 <freemint> ahh so refering to other databases does not happen on the storage level but at the content level.

19:10 <Regenaxer> A = 1 (hax notation), so file 1 (starting tith '@' = zero)

19:11 <Regenaxer> No idea about storage or content. The symbol itself encodes its location

19:12 <Regenaxer> See line 96 in structures

19:12 <Regenaxer> xx.xxxxxxxxx.xxxxxxx.xxxxxxxxxxx.xxxxxxx.xxxxxxxxxxxxxxxxxxxE010

19:12 <Regenaxer> obj file obj file obj

19:12 <Regenaxer> the x are the bits

19:12 <Regenaxer> so file and offet are intermixed

19:13 <Regenaxer> interleaved

19:14 <Regenaxer> the reason for this encoding is size

19:15 <freemint> you are to quick for me here. Now that we have blocks, how do we store information in them?

19:15 <Regenaxer> Using PLIO

19:16 <Regenaxer> Moment, brb

19:16 Regenaxer has left #picolisp [#picolisp]

19:16 Regenaxer has joined #picolisp

19:16 <Regenaxer> ret

19:17 <Regenaxer> The blocks are only used to store the PLIO

19:17 <freemint> That is the same PLIO format used by (rd) and 'pr which is refered to as "encoded binary format" in the docs?

19:17 <Regenaxer> The point is how the symbols interact in the heap

19:17 <Regenaxer> yes

19:19 <Regenaxer> So the block in the files are only used to fetch the data, and write back modifiaions (persistence)

19:19 <freemint> "The point is how the symbols interact in the heap" Do you want to say that there is no magic translation layer. just that you jumoto a certain offset rd pilIO from there put it into address space (heap) of the pucolisp programm and the picolisp progamm is responsible what it makes out of it?

19:19 <Regenaxer> The program logic itself uses normal symbols

19:20 <Regenaxer> Even simpler

19:20 <freemint> oh tell me how it can be even simpler?

19:20 <Regenaxer> The "certain offset" is used only to fetch the data

19:21 <freemint> Yes

19:21 <Regenaxer> after that you have *normal* symbols

19:21 <Regenaxer> no idea of file and block

19:21 <freemint> or a number?

19:21 <freemint> or a cell?

19:21 <Regenaxer> only symbols

19:21 <freemint> ok

19:21 <Regenaxer> the val or prop of such a symbol can contain anything

19:22 <Regenaxer> numbers, lists, other (also external) syms

19:22 <freemint> except nil as key in the property list ;)

19:22 <Regenaxer> right

19:22 <Regenaxer> And *not* certain graph structures ;)

19:23 <freemint> \me tries looks like he is not the perpetrator

19:24 <Regenaxer> hehe, no

19:25 <freemint> So the pilIO in a database is only symbols "containing" lists, numbers and other symbols?

19:25 <Regenaxer> right

19:26 <Regenaxer> It stores a single value and one propert list in a single block(-list)

19:27 <freemint> as a symbols do in picolisp (except NIL, T)

19:28 <Regenaxer> yes

19:28 <Regenaxer> NIL and T *may* have properties, just the value is protected

19:29 <freemint> I played around with 'pr and used 'hd to inspect the resulting file. I tried to serialize a symbol with 'pr but it did not work out can you take a look?

19:30 <freemint> : (setq Sym 'Value)

19:30 <freemint> -> Value : (out "sym" (pr Sym)) -> Value

19:30 <Regenaxer> 'pr' does not serialize a complete symbol

19:30 <freemint> : (hd "sym")

19:30 <Regenaxer> only prints an expression

19:31 <freemint> Ahhh that explains my problem

19:31 <freemint> how do serialize a symbol then

19:31 <freemint> so pr is only for lists and numbers?

19:31 <Regenaxer> you could do (out "sym" (pr (val Sym) (getl Sym)))

19:32 <Regenaxer> no, also internal, external, transient symbols

19:32 <Regenaxer> (out "a" (pr 1 "a" 'b '(c 7 d)))

19:33 <Regenaxer> (out "a" (pr *DB '{A7}))

19:33 <freemint> : (out "sym" (pr (val Sym) (getl Sym)))

19:33 <freemint> -> NIL

19:33 <freemint> : (hd "sym")

19:34 <freemint> 00000000 00 00 ..

19:34 <freemint> -> NIL

19:34 <freemint> : Sym

19:34 <freemint> -> Value

19:34 <freemint> something went wrong

19:34 <Regenaxer> (setq Value (1 2 3)) (put 'Value 'a 1)

19:35 <Regenaxer> then (out ...

19:35 <Regenaxer> Or directly (out "sym" (pr Value (getl 'Value)))

19:35 <freemint> Ah Value it self is a symbol without return value and p-list

19:36 <Regenaxer> Symbols do not have return values

19:36 <freemint> correct i ment value

19:37 <Regenaxer> (out "sym" (pr (val Sym) (getl Sym))) evaluates 'Sym' in both cases

19:37 <freemint> So it is (out "sym" (pr (val 'Sym) (getl 'Sym))) to serialize Sym

19:37 <Regenaxer> so it uses 'Value'

19:37 <Regenaxer> It serializes whatever 'Sym' points to

19:37 <freemint> T

19:38 <freemint> So how do serialize 'Sym so if i uuse (rd) ona file that contains it, i get back my trusty old Sym

19:41 <Regenaxer> (in ... (setq Sym (rd)) (putl 'Sym (rd)))

19:41 <freemint> but then it looses its name, or i need to know the name they symbol had before i read it back

19:42 <Regenaxer> yes

19:42 <Regenaxer> (out "sym" (pr Sym (val Sym) (getl Sym)))

19:43 <Regenaxer> 'in' analog

19:43 <freemint> You mean (pr 'Sym ... )?

19:44 <Regenaxer> no

19:44 <Regenaxer> (pr Sym

19:44 <Regenaxer> (for Sym (all) (pr Sym (val Sym) (getl Sym))))

19:44 <Regenaxer> (in "file" (while (rd) (set @ (rd)) (putl @ (rd]

19:47 <freemint> Ok so there is no way to serialize a symbol as a "single thing" in PilIO i can only serialize it's components.

19:47 <Regenaxer> yes

19:47 <Regenaxer> It is not useful in general

19:47 <Regenaxer> externals are good

19:47 <Regenaxer> But how about transients?

19:48 <Regenaxer> You get new symbols upon read

19:48 <Regenaxer> And internal symbols are usually set in some source files

19:49 <freemint> i really need a better grasp at what a symbol is.

19:49 <freemint> So when reading back in it is a matter of convention. What convention did you choose for database?

19:49 <Regenaxer> Similar to the above. Value, then plist

19:49 <Regenaxer> as the name is implied anyway

19:50 <freemint> was there a reason why you wanted to that?

19:50 <Regenaxer> (for externals)

19:50 <Regenaxer> How do you mean that?

19:50 <Regenaxer> Value, then plist?

19:50 <freemint> Yes

19:50 <freemint> The name is applied by the {($file)($offset)} scheme?

19:51 <Regenaxer> right

19:51 <Regenaxer> and a symbol has max 3 compunets only

19:51 <Regenaxer> compo

19:51 <freemint> name, value and p-list?

19:51 <Regenaxer> yes

19:51 <Regenaxer> all three optional

19:51 <Regenaxer> well, value is always, but may be NIL

19:52 <Regenaxer> So what is a symbol? Good question

19:52 <Regenaxer> a place

19:52 <Regenaxer> a structured address

19:52 <freemint> A symbol is a place?

19:52 <Regenaxer> kind of

19:52 <freemint> and a reference to a place

19:53 <freemint> at the same time?

19:53 <Regenaxer> it has its unique address

19:53 <Regenaxer> a pointer

19:53 <Regenaxer> or reference

19:53 <Regenaxer> as you like

19:53 <Regenaxer> it is an address with properties

19:54 <freemint> reference is another word for pointer buty ou hint at the tag bits which say which type it is?

19:54 <Regenaxer> as value and name are in fact also just properties

19:54 <Regenaxer> only special for efficiency

19:54 <Regenaxer> yes, the type bits are needed because we have also numbers and lists in the same address space

19:55 <freemint> you confuse me. the value is stored in the address?

19:55 <Regenaxer> yes

19:55 <freemint> or is it stored at the address?

19:55 <Regenaxer> ah, *at*

19:55 <Regenaxer> physically

19:55 <freemint> so the value is in the place

19:55 <freemint> but they type info is in the address?

19:55 <Regenaxer> I thought the question was what a symbol is logically

19:56 <Regenaxer> yes, the type is known, as we have a symbol

19:56 <Regenaxer> so a symbol is a place to store properties

19:56 <freemint> are there different types of symbols which can be differentiated at the address level?

19:57 <freemint> that makes sense

19:57 <Regenaxer> The only "hard" type is in the pointer of an external symbol

19:57 <Regenaxer> The other three are depending on the context

19:58 <Regenaxer> A symbol may be intern in one namespace but transient in another

19:58 <freemint> what types of symbols were there?

19:58 <Regenaxer> Only an anonymous sym is never intern or transient

19:58 <Regenaxer> intern, extern, transient, anonymous

19:58 <Regenaxer> these 4

19:59 <Regenaxer> anonymous is a special case of transient

19:59 <freemint> ahh

19:59 <Regenaxer> without name at all

19:59 <Regenaxer> (new) or (box)

20:00 <freemint> are they used to construct objects in a OO programming? i ame would only be a hinderance

20:00 <freemint> *a name

20:00 <Regenaxer> yes

20:00 <freemint> other uses?

20:00 <Regenaxer> not really a hindrance

20:00 <Regenaxer> but someone would have to think up the names

20:01 <freemint> ans it would only space

20:01 <Regenaxer> Other uses are also non-OO cases

20:01 <freemint> like?

20:01 <Regenaxer> the space is used anyway

20:01 <Regenaxer> up to 7 chars are free ;)

20:01 <freemint> depends on how long the name is ;)

20:01 <Regenaxer> yes

20:02 <Regenaxer> Sometimes I make (box) without having a class etc

20:02 <Regenaxer> just to store some stuff

20:03 <Regenaxer> (let S (box 123) (put S 'a 1) (put S 's "abc) (doSomething with S))

20:03 <Regenaxer> not often though

20:05 freemint_ has joined #picolisp

20:06 freemint has quit [Ping timeout: 256 seconds]

20:06 <Regenaxer> like

20:06 <Regenaxer> : (pp 'expr)

20:06 <Regenaxer> (de expr ("F")

20:06 <Regenaxer> (set "F"

20:06 <Regenaxer> (list '@ (list 'pass (box (getd "F")))) ) )

20:06 <Regenaxer> -> expr

20:07 <Regenaxer> : (expr '+)

20:07 <Regenaxer> -> (@ (pass $177541625022177))

20:07 freemint_ has quit [Client Quit]

20:07 <Regenaxer> : +

20:07 <Regenaxer> -> (@ (pass $177541625022177))

20:07 freemint has joined #picolisp

20:09 <freemint> so when are two symbols the same, the name does not cut it since there are symbols which have no name

20:13 <freemint> i have played around and i noticed that there are two different "sames"

20:14 <Regenaxer> yes, '=' and '==' ?

20:14 <freemint> somethings are = but not ==

20:14 <Regenaxer> yes

20:15 <Regenaxer> '==' is exactly the same item

20:15 <Regenaxer> ie the same address

20:15 <Regenaxer> "pointer equality"

20:15 <freemint> so they have the same reference

20:16 <Regenaxer> T

20:16 longshi has joined #picolisp

20:16 <Regenaxer> Comparison with '==' is fast

20:16 <Regenaxer> '=' needs to traverse the structures

20:17 <Regenaxer> (name characters, or list elements)

20:17 <freemint> Addresses point at cells

20:17 <freemint> ?

20:17 orivej has quit [Ping timeout: 245 seconds]

20:18 <Regenaxer> yes

20:18 <Regenaxer> they point to the first, 4th or 8th byte of a cell (in pil64)

20:19 <Regenaxer> first is pair, 4th is bignum and 8th is symbol

20:19 <freemint> Why that?

20:19 <freemint> ah

20:19 <Regenaxer> so for a symbol it points to the value

20:19 <Regenaxer> (car Sym) is the same as (val Sym)

20:19 <freemint> Is it possible to have two "different" addresses point to the same cell? As one address says you find a number the other says you find a list?

20:20 <Regenaxer> It would never be the same address, because the tag bits modify it

20:21 <Regenaxer> Would be possible if the tag systematics would be different

20:21 <Regenaxer> (outside the actual pointer)

20:22 <freemint> so having two different of address to a single cell only causes chaos?

20:23 <Regenaxer> Not chaos, you can do that with 'adr'

20:23 <Regenaxer> Lets say it is a bit surprising

20:23 <freemint> is there any use in it?

20:23 <Regenaxer> and may crash easily

20:23 <Regenaxer> Some debugging

20:23 <Regenaxer> or brute force poking in the heap

20:24 <freemint> let me try to summarize about address and cells

20:24 <Regenaxer> good

20:25 <freemint> PicoLisp has a heap made out of cells where it stores all symbols, numbers and lists?

20:25 <Regenaxer> correct

20:26 <freemint> cells itself are not aware whether they are number or a symbol, you need an address containg the type get meaning out of a cell

20:26 <Regenaxer> exactly

20:26 <Regenaxer> It is in the eye of the observer

20:27 <Regenaxer> a cell just *is* ;)

20:27 <freemint> i always thought there were flags for types in the cell. Guess i was very mistaken.

20:27 <Regenaxer> yes, only the mark bits

20:27 <Regenaxer> gc and circ

20:28 <freemint> that reminds me of evil structures i built.

20:28 <Regenaxer> the graphs?

20:28 <freemint> yes

20:29 <freemint> anyway is there more to say about data storage in ram, than there are typeless cells and the type is in the pointer?

20:30 <freemint> Oh there is garberage collection ... but that is a topic for another time.

20:30 <Regenaxer> Perhaps that the heap is segmented in chunks

20:31 <Regenaxer> they are linked together

20:31 <Regenaxer> Each heap segment is 1 MiB

20:31 <freemint> i think that is not important at application level. it is a the GC level

20:31 <Regenaxer> yes

20:33 <freemint> tracking back we started out with the database now that i got a better understanding of cells can you explain again the "simple" way how we load {A7}.

20:34 <Regenaxer> the name has two parts

20:34 <Regenaxer> the first one is in "hax" notation, @ - O

20:34 <Regenaxer> so it is a hex number encoding the file

20:35 <Regenaxer> the other is in octal notation for the block number

20:35 <Regenaxer> pil32 user a syntax {123-45}

20:35 <freemint> a that is why i cam in to trouble when trying to get the {9}

20:35 <Regenaxer> yes

20:35 <Regenaxer> pil32 was inefficient

20:36 <Regenaxer> the name was really in ASCII

20:36 <Regenaxer> and needed a delimiter "-"

20:36 <Regenaxer> with hax/octal it is clear which part is which

20:37 <Regenaxer> and internally it is stored as interleaved bit patterns

20:37 <Regenaxer> so few files with few objects result in shorter names

20:37 <Regenaxer> and thus less space on disk

20:38 <Regenaxer> (in the heap the size is always exactly one cell)

20:38 <freemint> i still do not understand the roole of the interleaved bit pattern

20:38 <freemint> where do we use it?

20:39 <Regenaxer> the name is technically a number

20:39 <Regenaxer> a short num

20:39 <Regenaxer> using a variable size in PLIO

20:39 <freemint> so external symbols have a name

20:39 <Regenaxer> Do hd on (pr 12) vs (pr 123456789)

20:39 <Regenaxer> yes

20:39 <Regenaxer> it is the bit pattern

20:39 <freemint> is that name {A7} or the bitpattern of

20:40 <Regenaxer> in the TAIL part of the sym

20:40 <Regenaxer> A7 gives a small number

20:40 <Regenaxer> even smaller are objects in the first file

20:40 <Regenaxer> {7}

20:41 <freemint> so the bit pattern is used in the heap as address?

20:41 <Regenaxer> use only 2 bytes

20:41 <Regenaxer> not in the heap

20:41 <Regenaxer> only in PLIO

20:41 <Regenaxer> in the heap it is used to locate the block

20:41 <freemint> ah, that is how we refer to blocks in different files

20:41 <Regenaxer> for read and later for write

20:42 <Regenaxer> yes

20:42 <Regenaxer> always a file and a block offset

20:43 <Regenaxer> The application does not use this name

20:43 <Regenaxer> Only when printing it during debugging

20:43 <freemint> so when i want to point somewhere i have a symbol at a block offset which is encode value first the property list. and the value is an interleaved bit pattern?

20:44 <freemint> this is {A9}?

20:44 <Regenaxer> no, the name is the bit pattern

20:44 <Regenaxer> 9 is not legal octal

20:44 <freemint> sorry

20:44 <Regenaxer> Would be {A11)

20:45 <freemint> where is the name stored i though we only serialized value and p-list in to the blocks with pilIO

20:45 <Regenaxer> The name of *that* symbol does not need to be stored

20:46 <Regenaxer> but if the data *in* the symbol refer to other objects, it is stored there

20:46 <freemint> because it was implicit in the position?

20:46 <Regenaxer> yes

20:47 <Regenaxer> try (out "a" (pr '{A11})) (hd "a")

20:47 <freemint> You mean:but if the data *in* the symbol refer to other symbols, the name of the other symbol is stored there?

20:47 <Regenaxer> yes

20:47 <Regenaxer> encoded as external symbol in PLIO

20:48 <freemint> 00000000 0F 09 00 10 ....

20:48 <freemint> 0F is A

20:48 <freemint> 09 is 11 in octal

20:48 <Regenaxer> just as an internal symbol 'a' would be encoded as INTERN + "a"

20:48 <freemint> what was 00 01 for?

20:49 <Regenaxer> 0F is EXTERN

20:49 <freemint> INTERN being a some constant from pilIO right?

20:49 <Regenaxer> the lowest 2 bits of the first byte encode th type

20:50 <Regenaxer> then 3 for the length

20:50 <freemint> 05 is intern?

20:50 <Regenaxer> 3 << 2 | 3

20:50 <Regenaxer> I C: enum {NUMBER, INTERN, TRANSIENT, EXTERN};

20:50 <Regenaxer> so INTERN is 1

20:50 <Regenaxer> length << 2 | type

20:51 <freemint> : (out "a" (pr 's)) (hd "a") 00000000 05 73

20:51 <Regenaxer> 09 00 10 is the interleaved bit pattern

20:51 <Regenaxer> xx.xxxxxxxxx.xxxxxxx.xxxxxxxxxxx.xxxxxxx.xxxxxxxxxxxxxxxxxxxE010

20:51 <Regenaxer> obj file obj file obj

20:52 <freemint> E?

20:52 <Regenaxer> it is an "extern" bit

20:52 <Regenaxer> as I said

20:52 <Regenaxer> type for externals only is stored in the pointer

20:52 <Regenaxer> so the bit pattern is shifted right by 4

20:52 <Regenaxer> as any number

20:53 <Regenaxer> so the 'obj' part is xxxxxxxxxxxxxxxxxxx

20:53 <Regenaxer> put here the octal 11

20:53 <freemint> so iam looking at a 64 bit address s interleaved bit patterns are addresses?

20:54 <Regenaxer> not a 64 bit address

20:54 <Regenaxer> file and obj (block)

20:55 <Regenaxer> It is not human-readable :)

20:55 <Regenaxer> it is to save space

20:55 <freemint> i fear it is not human understandable

20:55 <Regenaxer> perhaps

20:55 <Regenaxer> I never think about it too

20:55 <freemint> Can you put an abstraction over it so i do not have to thin about it?

20:56 <Regenaxer> yes, it is {A11}

20:56 <Regenaxer> We can forget that A is 1 shifted left by 19 or so

20:58 <Regenaxer> xx.xxxxxxxxx.xxxxxxx.xxxxxxxxxxx.xxxxxxx.xxxxxxxxxxxxxxxxxxxE010

20:58 <Regenaxer> obj file obj file obj

20:58 <Regenaxer> ^6 ^5 ^4 ^3 ^2

20:58 <Regenaxer> The last line shows how many bytes are needed in PLIO

20:58 <freemint> ahh so when a symbol {B2} refers to {A11} then there is a block at b2, which when rd is {A11} ?

20:58 <Regenaxer> So A11 needs 3 bytes

20:59 <Regenaxer> hmm, no

20:59 <Regenaxer> when a symbol {B2} refers to {A11} then the data (some property) have {A11}

20:59 <Regenaxer> (show '{B2})

21:00 <freemint> data = value?

21:00 <Regenaxer> you see {A11} somewhere then

21:00 <freemint> mhh so it depends how i refer to ti

21:00 <Regenaxer> Or DB symbols the value usually holds the classes

21:00 <freemint> if i use classes it is in the p-list ofcourse

21:00 <Regenaxer> But in B-tree nodes *all* is in the value

21:01 <Regenaxer> yes

21:01 <freemint> ah and if we have really many files or objects we need 4 or 5 bytes?

21:01 <Regenaxer> exactly

21:01 <freemint> Regenaxer: that is so the value does not need to be skipped?

21:01 <freemint> (B tree)

21:01 <Regenaxer> Where would it be skipped?

21:02 <freemint> if B trees used the p-list instead of the value

21:02 <freemint> and we serialize the value first, as per convention

21:03 <freemint> and the value is useless, we would need to skip it

21:03 <freemint> which is more inefficient that it has to be

21:04 <freemint> is that the reason why the b tree uses the value?

21:11 Regenaxer has quit [Ping timeout: 268 seconds]

21:11 Regenaxer has joined #picolisp

21:12 <Regenaxer> The btree node needs no properties

21:12 <Regenaxer> it is a list structure

21:12 <freemint> ok

21:13 <Regenaxer> searched with 'rank'

21:13 <freemint> i came across something weird while playing around

21:13 <freemint> finish your thought

21:13 <Regenaxer> no, done

21:14 <freemint> ? (pool "g") -> T ? (set (print (new T)) 9) {2}-> 9 ? (commit) -> T ? (bye) joto@l148:~$ pil + : (pool "g") -> T : {2} -> NIL

21:14 <freemint> I tried to store a number in the value of {2}

21:14 <Regenaxer> it is not fetche this way

21:15 <Regenaxer> you need (val '{2})

21:15 <freemint> there is my9

21:15 <Regenaxer> yes

21:15 <freemint> i am happy

21:16 <Regenaxer> The lowest-level eval does not trigger fetching of the symbol

21:16 <freemint> I thought that if X is a symbol (= X (val X))

21:16 <Regenaxer> It would be very expensive

21:16 <Regenaxer> and never happens

21:16 <Regenaxer> as externals should never be directly in the code

21:17 <Regenaxer> except *DB

21:17 <freemint> It worked in when i was just using the heap. But it holds no longer true for DB, you are right about the not using {2} in code.

21:17 <Regenaxer> So you always have 'val' or 'get' or derived

21:18 <Regenaxer> yes, the name is not known

21:18 <Regenaxer> and *if* code helds such a symbol, it would not be gc'ed and fill up the heap

21:18 <freemint> I think it we got all the basic about the storage side of data in the DB and in the heap

21:19 <Regenaxer> on a single symbol potentially the whole DB may hang

21:19 <Regenaxer> OK, good :)

21:19 <freemint> or have missed anything?

21:19 <Regenaxer> So lets stop for today, I need some stuff to clean up

21:19 <Regenaxer> probably

21:19 <Regenaxer> but you can ask again

21:19 <Regenaxer> or investigate a little

21:20 <freemint> Another day it might be interesting to built an understanding of actual db usage with classes from scratch

21:20 <Regenaxer> ok, yes, the next layer

21:20 <freemint> There are still some question marks in my mind there.

21:21 <freemint> For example how does picoLisp know where the index starts when there is no explicit reference to the index starting external symbol in the code

21:22 <Regenaxer> it all hangs on the *DB value

21:22 <Regenaxer> on {1}

21:22 <Regenaxer> we saw last time

21:22 <freemint> so {1} has a property for the indexes in the DB?

21:22 <Regenaxer> Entities are properties in {1}

21:23 <freemint> i ask you what entities are next time

21:23 <Regenaxer> We saw last time with (edit *DB)

21:23 <Regenaxer> ok

21:23 <freemint> it was really enlightening

21:23 <Regenaxer> Great! :)

21:23 <Regenaxer> Have a good night!

21:24 <freemint> good night

21:24 <Regenaxer> I'm afp now

21:24 <Regenaxer> bye! :)

21:24 <freemint> You earned it

21:24 <freemint> beneroth aw- alexshendi razzy rick42 tankf33der and all others. Do you have thoughts on what we just did?

21:39 razzy has quit [Ping timeout: 250 seconds]

22:29 shpx has joined #picolisp

22:39 freemint has quit [Quit: Page closed]

23:37 shpx has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

23:47 shpx has joined #picolisp