##openfpga on 2018-06-11 — irc logs at freenode.irclog.whitequark.org

00:23 balrog has quit [Ping timeout: 260 seconds]

00:51 q3k has quit [Ping timeout: 240 seconds]

01:06 <reportingsjr> awygle: did you end up making esden's FTDI multitool?

01:09 <reportingsjr> wait, that wasn't you

01:09 <reportingsjr> scratch that!

01:33 unixb0y has quit [Ping timeout: 268 seconds]

01:34 unixb0y has joined ##openfpga

01:36 q3k has joined ##openfpga

02:01 balrog has joined ##openfpga

02:37 GenTooMan has quit [Quit: Leaving]

03:07 X-Scale has quit [Quit: HydraIRC -> http://www.hydrairc.com <-]

03:13 ZipCPU_ has joined ##openfpga

03:15 ZipCPU has quit [Ping timeout: 276 seconds]

04:19 Bike has quit [Quit: Lost terminal]

04:46 shadow_dancer has joined ##openfpga

04:47 mwk has quit [Ping timeout: 256 seconds]

04:48 mwk has joined ##openfpga

04:55 shadowdancer has joined ##openfpga

04:56 <shadowdancer> hey, anyone familiar with moding ps4?

04:56 shadow_dancer has quit [Ping timeout: 260 seconds]

04:56 shadowdancer is now known as shadow_dancer

04:58 <rqou> wtf is up with the max v coordinate system?!

04:58 <rqou> nothing is making any sense at all

04:59 <rqou> shadow_dancer: some people here know something, but in general not too much

04:59 <kc8apf> rqou: xc7 has at least 3 different coordinate systems

05:01 <shadow_dancer> ima just want to power it up while disassembled

05:01 <rqou> i see one mux _clearly_ being numbered "I1" in one tile

05:01 <rqou> and "I0" in another tile

05:02 <shadow_dancer> putting it back together it powers up, disassembled it doesn't guessing the heatsink gotta shortcut something for it to powerup

05:02 <kc8apf> are they all the same tile?

05:02 <rqou> shadow_dancer: yeah, none of us know it to that level of detail

05:02 <rqou> kc8apf: well, one is directly above the other

05:02 <shadow_dancer> hmmm, q3k knos

05:02 <shadow_dancer> knows *

05:03 <shadow_dancer> that is why ima joined here, waiting to see if can contact him

05:03 <kc8apf> hmm. Is it possible they are part of a larger construct?

05:03 <rqou> what do you mean?

05:03 <kc8apf> could they be one big tile?

05:04 <rqou> unlikely?

05:04 <kc8apf> I haven't looked into what you've published so I'm guessing

05:06 <rqou> also i'm in general totally unable to get the wiring resources to properly add up

05:06 <kc8apf> I struggled for a long time with Vivado using a coordinate system that doesn't physically exist

05:07 <rqou> the altera coordinate system is correlated with things that physically exist

05:07 <kc8apf> well, the components physically exist but the coordinates were a mapping of a more complex system to X/Y coordinates

05:10 <rqou> in this file i see some bits that correspond to "I4" in one tile

05:11 <rqou> and two tiles over the same bits correspond to "I5" now

05:11 <rqou> this makes _no_ sense

05:17 <rqou> wtf wires claiming to originate from Y2 seem to have their control bits in Y1

05:24 <kc8apf> feed-through?

05:24 <kc8apf> there are some bits in INTL tiles that just get fed to the next tile to the left in xc7

05:25 <rqou> so i've managed to get the bitstream bits converted to pngs that appear to match the die layout

05:25 <rqou> and all of the bits are located about where i would physically expect them

05:25 <rqou> it's just that the numbering inside quartus makes no sense

05:29 <kc8apf> right. that's a bit of what we saw with those bits

05:29 <kc8apf> see cols 0 and 1 of https://symbiflow.github.io/prjxray-db/artix7/seg_clblm_l.html

05:29 <kc8apf> those are technically part of the INTL tile but their meaning is part of the adjacent CLBLM

05:30 <kc8apf> so the bits are physically in the INTL (and their addresses in the bitstream reflect that)

05:30 <rqou> oh, i already encountered that

05:30 <rqou> that's not the problem i'm having right now

05:32 shadow_dancer has quit [Ping timeout: 260 seconds]

05:45 shadowdancer has joined ##openfpga

06:05 <rqou> new investigation seems to suggest there are 7(?!) up/down wires?

06:05 soylentyellow__ has quit [Read error: Connection reset by peer]

06:06 soylentyellow has joined ##openfpga

06:12 StCipher has joined ##openfpga

06:15 StCypher has quit [Ping timeout: 255 seconds]

06:18 gnufan has quit [Ping timeout: 264 seconds]

06:21 gnufan has joined ##openfpga

06:28 <daveshah> kc8apf: that reminds me of the ice40 ultraplus, where one of the 8 DSPs swaps 2 config bits with an adjacent IPConnect tile

06:41 StCipher has quit [Ping timeout: 264 seconds]

06:41 massi has quit [Remote host closed the connection]

06:45 massi has joined ##openfpga

07:06 <rqou> ok, wat

07:06 <rqou> i have a particular set of mux bits

07:07 <rqou> and in rows 1 and 4 a certain bit pattern selects a certain wire

07:07 <rqou> and in rows 2/3 it does a different wire

07:07 <rqou> but these wires are all logically named "I0"

07:34 StCypher has joined ##openfpga

07:48 <rqou> ping azonenberg

07:49 <azonenberg> ack

07:50 <rqou> i'm seeing some behavior that totally doesn't make sense to me

07:50 <rqou> i'm pretty sure tile inputs work the way i've hypothesized last time

07:50 <rqou> namely 3x 4-to-1 one-hot muxes sharing control bits followed by a 4-to-1 one-hot mux

07:51 <rqou> so 8 control bits total, 13 unique inputs

07:51 <rqou> anyways, so i tried to fuzz e.g. the mux that controls LAB track number 0

07:52 <rqou> and for a certain pair of set bits (the "same" input), as i move up/down the column

07:52 <rqou> the _logical_ index of the source wire is the same

07:52 <rqou> but the _physical_ location of which mux bits are getting set to drive that source wire don't make any sense

07:53 <rqou> any guesses as to wtf is going on?

07:53 <azonenberg> first guess: tiles are not arranged the same way logically and physically

07:53 <azonenberg> Say, a tile is 2x as high as it is wide physically but is logically square

07:53 <rqou> um, that definitely does not seem to be the problem

07:53 <rqou> the LUT bits all show up in the expected place

07:54 <azonenberg> next possibility, tiles are in groups of (say) 2x2 and mirrored?

07:54 <azonenberg> or possibly just interconnect?

07:54 <rqou> not in any consistent way

07:54 <rqou> so e.g. for LAB line 0

07:54 <rqou> i have a pattern

07:54 <rqou> 0111

07:54 <rqou> 1110

07:54 <rqou> this seems to select "a right-going wire two tiles to the left"

07:55 <rqou> but which right-going wire?

07:55 <rqou> in rows 1 and 4 it selects the third one

07:55 <rqou> in rows 2 and 3 it selects the first one

07:55 <rqou> but i have a different pattern

07:55 <rqou> 1011

07:55 <rqou> 1110

07:55 <rqou> this selects "a right-going wire in this tile"

07:56 <rqou> but in row 2 it selects the first one and in all other rows it selects the third one

07:56 <rqou> but ALL OF THESE HAVE INDEX 0?!

07:56 <azonenberg> ??

07:56 <rqou> i know, right?

07:56 <azonenberg> talk about non-orthogonality

07:57 <rqou> so wtf do you think is happening?

07:58 <rqou> also, afaict there are _definitely_ wires originating out of the io tiles

07:58 <rqou> but their coordinates are not in the io tile

07:58 <rqou> the coordinates get forced into some other tile

07:59 <rqou> also also, i still cannot get the numbers to add up to the routing resources that quartus will report

07:59 <rqou> maybe the report is bogus?!

08:00 <rqou> azonenberg: so, afaict based on the bits

08:00 <rqou> each tile has 8 left wires, 8 right wires, 7 up wires, and 7 down wires

08:00 <rqou> io tiles have an unknown number of wires

08:01 <azonenberg> The bits dont lie

08:01 <azonenberg> reports are useful but should not be considered absolute truth

08:03 <rqou> hmm, the "direct links" count in the report seems busted too

08:04 <rqou> it claims 888, which depending on how you interpret it is either way too many or way too few

08:04 <rqou> waaaait a sec

08:05 <rqou> the "direct links" number is more plausible if you use the restricted LE count

08:05 <rqou> um... could the R4/C4 numbers be like that too?

08:06 <azonenberg> i guess?

08:08 <rqou> what

08:08 <rqou> no, it reports the same number for the real 240LE

08:08 <rqou> i think i'm going to start ignoring the report; it makes no sense

08:09 <rqou> btw 8 l/r and 7 u/d is consistent with the "routing channel width" numbers that we had disregarded earlier

08:12 <rqou> so azonenberg, what next?

08:12 <rqou> i'm pretty stuck on "wtf is this crazy coordinates/numbering scheme"

08:12 <rqou> but in general the mux patterns kinda make sense?

08:13 <rqou> it's just not clear exactly which wires go into them

08:22 shadow_dancer has joined ##openfpga

08:25 <rqou> azonenberg: what would you focus on fuzzing next?

08:25 <rqou> i'm pretty stuck right now on the coordinates issue

08:25 <rqou> but it seems to be blocking getting a deeper understanding?

08:25 shadowdancer has quit [Ping timeout: 265 seconds]

08:33 sdancer has joined ##openfpga

08:36 shadow_dancer has quit [Ping timeout: 276 seconds]

08:52 <rqou> ok, the column bits seem to be much more consistent

08:53 <rqou> it seems like the row wires are just shuffled somehow

08:59 Hamilton has joined ##openfpga

09:02 sdancer has quit [Ping timeout: 265 seconds]

09:08 <rqou> hmm, based on the data i have here i wonder if N3/N8 neighbor connections are several ps slower than the others

09:09 <rqou> since it seems to be one more mux level

09:20 <rqou> so afaict only the row wire numbering is fucked

09:20 <rqou> the column wire numbering seems to make some amount of sense?

09:21 <rqou> i'm pretty sure i've marked all of the control bits correctly except at the edges

09:21 <rqou> so i think the next step will be to fix my own 2d coordinates

09:22 <rqou> and then mark the bits that are involved in IO cell muxes (not necessarily decoding them yet)

09:22 <rqou> and then actually try to decode mux values

09:23 <rqou> which should be much easier once i know what bits control what rather than going in blind

09:43 shadowdancer has joined ##openfpga

09:43 shadowdancer has quit [Remote host closed the connection]

09:44 shadowdancer has joined ##openfpga

09:49 shadowdancer has quit [Ping timeout: 256 seconds]

09:55 shadowdancer has joined ##openfpga

10:24 <q3k> shadowdancer: i know nothing about the ps4..?

10:24 <q3k> not sure where you got that impression from

10:33 shadow_dancer has joined ##openfpga

10:36 dfgg has quit [Remote host closed the connection]

10:37 shadowdancer has quit [Ping timeout: 276 seconds]

10:41 dfgg has joined ##openfpga

10:50 Hamilton has quit [Quit: Leaving]

11:24 ondrej2 has joined ##openfpga

12:38 merskiasa has quit [Ping timeout: 260 seconds]

12:38 X-Scale has joined ##openfpga

12:56 Bike has joined ##openfpga

13:06 bitd has joined ##openfpga

13:09 shadow_dancer has quit [Ping timeout: 240 seconds]

13:15 shadowdancer has joined ##openfpga

13:22 shadow_dancer has joined ##openfpga

13:24 shadowdancer has quit [Ping timeout: 245 seconds]

14:14 Miyu has joined ##openfpga

14:37 genii has joined ##openfpga

14:53 azonenberg_work has quit [Ping timeout: 245 seconds]

15:52 azonenberg_work has joined ##openfpga

15:54 DocScrutinizer05 has quit [Quit: EEEEEEK]

16:13 pie_ has joined ##openfpga

16:15 azonenberg_work has quit [Ping timeout: 268 seconds]

16:44 scrts has quit [Ping timeout: 264 seconds]

16:45 azonenberg_work has joined ##openfpga

17:16 scrts has joined ##openfpga

17:21 pie_ has quit [Quit: Leaving]

17:38 digshadow1 has quit [Ping timeout: 265 seconds]

18:00 digshadow has joined ##openfpga

18:04 <rqou> huh, I've been pinged multiple times now on GitHub regarding Rust and SVD/device support crates

18:04 <rqou> i should probably allocate some time to deal with it

18:05 shadow_dancer has quit [Ping timeout: 240 seconds]

18:07 <awygle> is it possible to run svd2rust as part of a build script? so that you don't have to include a device-specific crate every time?

18:07 <rqou> there was a comment somewhere that japaric doesn't like that idea

18:08 <rqou> but yes, of course it's possible

18:09 <rqou> since build scripts can run arbitrary subprocesses

18:09 <cr1901_modern> You have svd crates?

18:09 <rqou> (expecting whitequark to jump in at any moment now and call these "typical rqou hacks")

18:10 <rqou> cr1901_modern: i was trying to maintain some unified stm32/efm32 crates earlier

18:10 <rqou> but this requires a ton of effort that i haven't fully invested into it

18:10 <rqou> and overall the ecosystem for this kinda sucks

18:14 <cr1901_modern> I like the core idea of the structs svd2rust generates, even if svd files are of varying quality (*cough* NXP)

18:16 DocScrutinizer05 has joined ##openfpga

18:18 <rqou> a bunch of people have ideas that i disagree with, so I should probably find some time to jump into the conversation that they've pinged me on

18:22 <rqou> also, overall I've been finding japaric's rtfm framework itself pretty great, but embedded-hal seems pretty unusable

18:22 <rqou> which is disappointing because i love the idea of embedded-hal

18:22 <rqou> it just doesn't seem to actually be very usable

18:23 <rqou> also japaric has been going around fucking everything up recently so I'm still stuck on a two-month-old nightly until i have time to make everything working again

18:29 <awygle> rqou: can you point me at this discussion? i'm at least an interested observer

18:29 <rqou> awygle: here is one: https://github.com/adamgreig/stm32-rs/issues/8

18:29 <rqou> awygle: here is the recent one: https://github.com/rust-lang-nursery/embedded-wg/issues/101#issuecomment-396220254

18:39 user10032 has joined ##openfpga

18:44 <awygle> rqou: thanks

18:59 shadowdancer has joined ##openfpga

19:07 <awygle> rqou: hm. this is the closest thing to my position https://github.com/rust-lang-nursery/embedded-wg/issues/101#issuecomment-391682988 . seems like a lot of people talking about the wrong abstraction.

19:12 <rqou> awygle: i like the essence of that idea, but that post has a bunch of "extraneous" comments that make me wary

19:15 <rqou> awygle: specifically the "only support 7 modules (that covers 99% of Adafruit stuff)" comment

19:15 <awygle> yeah lol

19:15 <rqou> this is often a red flag for me that this will become a useless Ardui-noob api that isn't actually powerful enough for real use

19:15 <awygle> I don't actually agree even with the proposal as written but at least that person seems to be looking at slicing the right way

19:16 <rqou> (embedded-hal _already_ has this problem)

19:16 <awygle> I don't even really know why SVD is so prevalent in the discussion

19:17 <rqou> yeah I'm not really tied to the idea of svd

19:17 <awygle> I really like the way chibios is arranged

19:17 <rqou> see for example my svd fragments that have to run through the c preprocessor first

19:18 <rqou> hmm i should look into that

19:18 <rqou> I've seen several people recommend it

19:18 <rqou> in general though my opinion is that i hate HALs

19:18 <awygle> A chip as a collection of peripheral drivers, a board as a mapping from pins to peripherals (in short, glossing over a lot)

19:19 <rqou> i just don't get the point of "board" abstraction

19:19 <awygle> Well, your opinion is wrong :-P HALs are hugely useful for all those cases where you're not pushing the envelope, as long as they're reasonably sane

19:19 <awygle> You can always beat a hal, but that's not the point

19:21 <awygle> I'm not super married to a board level abstraction but somebody somewhere has to know what pins go to what, and it's nice if that's all in one place for purposes of porting

19:21 <cr1901_modern> Also, rqou/awygle, you both idle in #rust-embedded. Why not discuss embedded rust in there where you could actually get help?

19:22 <rqou> because there's never activity there

19:22 <cr1901_modern> ppl will get back to you, you just need to be patient

19:23 <awygle> because Im not actually trying to do anything. I just enjoy shooting the breeze. I'm not writing any kind of embedded code, currently.

19:23 <awygle> If I needed help I'd go there

19:25 <rqou> in my (one, so not very representative) attempt to contact japaric it didn't go particularly well

19:26 <cr1901_modern> If you DM him he will eventually get back to you. I understand OT in #openfpga, but doing so for embedded rust seems incredibly redundant (when not everyone in here uses Rust like that in the first place, if at all).

19:26 m_w has quit [Quit: leaving]

19:31 <rqou> well, my one attempt to contact japaric went like this: "plz 2 comment out this one line of code. it doesn't break anything and fixes cortex-m0. <crickets, time passes> oh, i fixed it in the latest release (which also broke a whole bunch of other stuff)"

19:34 <awygle> cr1901_modern: my concern is that if I complain, idly and without research, in #rust-embedded, it will sound like I'm asking for a change. That will either burn social capital with the community as they explain all the ways my uninformed complaints are uninformed, or cause a bunch of people to do a bunch of work based on my idle musings. The first is bad for me, the second bad for others. Complaining here is safe.

19:35 <awygle> When/if I actually want to engage the community, I'll do a lot more homework.

19:36 <cr1901_modern> awygle: Social capital? You're one of the reasons msp430 works in the first place! :P

19:39 <rqou> in general i find "large" communities not really worth the effort to interact with

19:40 <awygle> cr1901_modern: well, yes :) but that was at a substantially lower level of the stack (and i have a pile of TODOs in that area that realistically i won't get back to for maybe as much as a year)

19:54 DocScrutinizer05 has quit [Disconnected by services]

19:54 DocScrutinizer05 has joined ##openfpga

19:55 DocScrutinizer05 has quit [Disconnected by services]

19:55 DocScrutinizer05 has joined ##openfpga

20:02 digshadow has quit [Ping timeout: 256 seconds]

20:07 DocScrutinizer05 has quit [Quit: EEEEEEK]

20:08 DocScrutinizer05 has joined ##openfpga

20:15 <Ultrasauce> to throw a little more on the OT pile, today a technical rep from a major camera vendor told me to reverse engineer their product to avoid having to go through the nda/partnership/knowledge transfer process

20:15 <Ultrasauce> I am a little weirded out!

20:16 <Bike> that sounds pretty shady.

20:20 <shapr> yikes

20:26 digshadow has joined ##openfpga

20:27 <balrog> Ultrasauce: haaaah

20:27 <balrog> because they refuse to provide reasonable documentation?

20:32 mwk has quit [Ping timeout: 240 seconds]

20:32 ovf_ is now known as ovf

20:33 mwk has joined ##openfpga

20:41 m_w has joined ##openfpga

20:57 <awygle> wow seriously?

20:57 <awygle> i can't decide if that's horrifying or awesome. probably both.

20:58 <Bike> If they have one of those "if you use this you can't RE it" agreements couldn't they be huge assholes and pursue you?

21:00 Miyu has quit [Ping timeout: 260 seconds]

21:02 <Ultrasauce> it certainly does feel like a potential trap, not that I'd ascribe any explicit ill intent to the suggestion

21:06 <Bike> yeah, they probably wouldn't actually do that, but an advantage of going through the legal gibberish is that they can't change their minds

21:13 user10032 has quit [Quit: Leaving]

21:20 DocScrutinizer05 has quit [Quit: EEEEEEK]

21:21 DocScrutinizer05 has joined ##openfpga

21:25 bitd has quit [Quit: Leaving]

21:25 Bike has quit [Ping timeout: 260 seconds]

21:31 numarkaee has joined ##openfpga

21:33 DocScrutinizer05 has quit [Ping timeout: 256 seconds]

21:41 pie_ has joined ##openfpga

21:42 <azonenberg_work> awygle: So i re-ran the numbers for the mac table w/ more significant digits

21:43 <azonenberg_work> If we have minimum length packets at full line rates on all interface

21:43 <azonenberg_work> We have a max of 95.23 Mpps

21:43 <azonenberg_work> Which means we need to average 1.64 clocks per packet if the MAC table is running at 156.25 MHz

21:43 <azonenberg_work> that's more margin than i thought, but i still want to try and pipeline it to do one lookup per cloc

21:44 <azonenberg_work> That would allow me to process 156.25 Mpps, or 107.9 Gbps, of min-sized packets without blocking in the mac table

21:45 <azonenberg_work> Still only ~half the performance I need for LATENTORANGE though, i will probably have to do a dual-panel table and/or upclock to 312.5 MHz for that

21:46 <azonenberg_work> (targeting 280 Gbps max throughput there)

21:46 <q3k> i wonder if that's something commercial switches actually handle well

21:46 <q3k> i wouldn't be surprised if they just start flooding

21:48 <azonenberg_work> Don't know

21:49 <azonenberg_work> As long as you never send >1 Gbps per gig port and >10 Gbps per 10G port, LATENTRED should not drop anything or flood ever

21:49 <azonenberg_work> If you have bursts of faster data, the buffers will cover it up to a point

21:50 <azonenberg_work> in particular, the 72 Mb of QDR-II+ can handle up to 7.2 ms of full line rate 10G traffic going to a single 1G interface

21:50 <azonenberg_work> before filling up

21:50 <azonenberg_work> at which point it'll start to drop 90% of the traffic

21:51 <azonenberg_work> Actually the 7.2 ms assumes i'm not emptying the buffer as i fill it

21:51 <azonenberg_work> So actually i think it comes out to 8 ms

21:52 <azonenberg_work> in any case, that is kind of an unavoidable problem if you are rate-matching interfaces, all you can do is make the buffer bigger but dropping is inevitable in that situation

21:52 <azonenberg_work> Not something i can fix architecturally

21:52 <q3k> i'm still disguisted by how expensive commercial 10GbE switches are

21:52 <q3k> fucking broadcom monopoly

21:53 <azonenberg_work> lol

21:53 <azonenberg_work> how expensive are you complaining about?

21:53 DocScrutinizer05 has joined ##openfpga

21:53 <q3k> well, expensive for a hackerspace/home lab

21:53 <azonenberg_work> give me a number

21:53 <q3k> i think the arista I want is around $2k

21:53 <q3k> second-hand

21:54 <azonenberg_work> So, the BOM cost for LATENTRED right now (incomplete, for example i dont have all of the passives for the brain board yet)

21:54 <azonenberg_work> is about 1.2k in components alone

21:54 <azonenberg_work> For single unit volume

21:54 <q3k> sure

21:54 <azonenberg_work> then several hundred in PCBs

21:54 <q3k> i don't mind paying that for low-volume hardware

21:54 <azonenberg_work> and then the custom 1U case

21:54 <q3k> i mind paying that for mass-produced second-hand hardware

21:55 <azonenberg_work> Yeah

21:55 <q3k> yeah, $3k+ actually

21:55 digshadow has quit [Ping timeout: 276 seconds]

21:55 <q3k> https://www.ebay.com/itm/Arista-DCS-7050SX-64-F-48-SFP-Ports-4-QSFP-40G-Switch-Managed-7050X/163073258778 cheapest one

21:56 <azonenberg_work> also keep in mind that LATENTRED is 24x 1G / 4x 10G interfaces

21:56 <q3k> i know

21:56 <q3k> still worth it when it comes to experimental hw

21:56 <azonenberg_work> The thing you linked is closer to LATENTORANGE, which will be tentatively 28 10G lanes, with a TBD mix of 10G and 40G ports

21:56 shadow_dancer has joined ##openfpga

21:57 <q3k> might end up going with a juniper qfx3500

21:57 <q3k> but then I don't have access to firmware downloads

21:57 <q3k> and need a license for bgp (!)

21:57 <q3k> ugh

21:58 shadowdancer has quit [Ping timeout: 268 seconds]

22:01 <rqou> aaaaaargh i just spent ages hunting down a bizarro hardware bug

22:01 <rqou> *firmware bug

22:01 <rqou> turns out I got bit by store/reorder buffers

22:01 <rqou> in a cortex *m*

22:04 hackkitten has quit [Read error: Connection reset by peer]

22:04 hackkitten has joined ##openfpga

22:05 <awygle> arm's memory model is bonkers

22:07 <whitequark> what

22:08 <awygle> well okay. bonkers is not fair. but it's much looser wrt ordering than x86 or x86-64

22:08 <pie_> single stack was a mistake

22:09 <rqou> if you clear an interrupt pending flag too close to the end of the isr handler, the write can get buffered and cause the handler to get entered again

22:10 <q3k> >arm's memory model is bonkers

22:10 * q3k [laughs in MIPS]

22:12 <awygle> mips is not in the list im' looking at

22:15 GenTooMan has joined ##openfpga

22:17 <whitequark> rqou: oh

22:17 <whitequark> wow, I'll need to keep that in mind

22:18 <rqou> yeah, so if your timers ever appear to be firing twice as quickly, this is one possible reason :P

22:18 <azonenberg_work> rqou: did you not do a dsb before the end of the ISR?

22:18 <q3k> i tend to always treat interrupts as possibly spurious in my systems code

22:18 <rqou> no, you usually don't need one

22:18 <q3k> is not pretty but protects you against weird races like that

22:19 <azonenberg_work> q3k: and i prefer to not use interrupts and write event-driven code where hardware does all of the hard-realtime stuff and you just pop an event queue as you get aroudn to things :P

22:21 <q3k> well, you're not supposed to do hard work in ISRs anyway

22:21 <azonenberg_work> ??

22:21 <q3k> drop that event into a queue, schedule a bottom half, run the bottom half when your system is idle

22:21 <azonenberg_work> how do you normally handle things like "this has to be done within 5 clocks of pin X going high"

22:21 <q3k> you wanna do that in software? :P

22:21 <azonenberg_work> No :p

22:21 <whitequark> you're absolutely supposed to do work in ISRs

22:21 <azonenberg_work> Which is why i use FPGAs for almost all of my embedded work these days

22:21 <whitequark> cortex-m-rtfm is built entirely around that

22:22 <azonenberg_work> my point is, i prefer the event-driven model

22:22 <q3k> whitequark: and I'll argue this is poor practice

22:22 <azonenberg_work> so why have your CPU be interrupted at all?

22:22 <azonenberg_work> Why not just design the architecture so the hardware puts events in the queue for you, then you just pop when idle?

22:22 <whitequark> q3k: do you have a non-cargo-cult reason?

22:23 <azonenberg_work> This is why in antikernel most of my CPUs don't even support interrupts

22:23 <q3k> whitequark: starvation of non-interrupt-driven logic

22:23 <azonenberg_work> It's much more deterministic this way

22:23 <whitequark> q3k: you don't have to have any.

22:23 <whitequark> for one.

22:24 <q3k> whitequark: and oftentimes I end up having code have to synchronize data from multiple sources, so I prefer getting them accessible from a single thread safely as fast as possible

22:24 <whitequark> and for another, if you do event-driven, you're just exchanging that for losing events *and* you can still starve other logic if event-driven logic takes too much

22:24 <whitequark> that's a better reason

22:25 <q3k> right, but with event driven it's much easier to apply backpressure on different parts of the system to limit starvation

22:25 <q3k> and to actually measure your system load by different event types

22:25 <whitequark> you can't apply backpressure to interrupts, just like you can't block in them...

22:25 <azonenberg_work> q3k: in antikernel i planned to implement that by having ulimits per event source

22:26 <azonenberg_work> For example, once the NIC has more than 32 pages allocated to it, future mallocs will fail with "you're using too much ram"

22:26 Bike has joined ##openfpga

22:26 <azonenberg_work> and ethernet frames will be dropped until the IP stack (whether SW or HW) catches up and frees some of the pages the NIC is using

22:33 <awygle> i would have expected reti to act as a barrier, 'parently not

22:35 <awygle> i usually try to avoid substantial work in interrupts, but i've mostly worked on systems where the main interrupts are "DMA complete" interrupts that just need to wake up a thread to deal with the new buffers

22:37 <rqou> arm doesn't have reti

22:37 <rqou> especially not in cortex-m

22:37 <rqou> it's just a normal bx lr

22:39 <whitequark> um, no

22:39 <whitequark> the instruction encoding is normal but the return isn't

22:39 <whitequark> you have a special value in lr

22:39 <rqou> well yes

22:40 <whitequark> so the core can do whatever

22:40 <whitequark> for that matter, it *does* whatever

22:40 <rqou> but it's not a separate opcode like x86

22:40 <rqou> i guess `bx lr with a magic value` could have been made to be a barrier

22:41 <awygle> huh. i've never written arm asm, but i googled "arm reti" before i said that and got results that implied to me it existed. is it an alias?

22:42 <awygle> oh, no, i see

22:42 <awygle> nvm, reading comprehension failure

22:44 <awygle> wow, that seems kind of ugly actually, vis a vis the hardware

22:45 gnufan has quit [Ping timeout: 255 seconds]

22:48 <whitequark> awygle: I think the idea is that interrupt handlers are just C functions

22:49 <whitequark> NVIC knows the C ABI, reusing bx lr was the last missing part

22:52 <awygle> i can see it being useful for the software. kind of weird for hardware to be reading return addresses and doing atypical things though. seems like a big comparator for one thing.

22:52 <awygle> *shrug* it clearly works for them

22:53 gnufan has joined ##openfpga

22:55 <pie_> tl;dr, proably interesting https://blog.invisiblethings.org/2018/06/11/graphene-ng.html

23:03 [X-Scale] has joined ##openfpga

23:05 X-Scale has quit [Ping timeout: 240 seconds]

23:05 [X-Scale] is now known as X-Scale

23:17 gnufan has quit [Ping timeout: 265 seconds]

23:18 gnufan has joined ##openfpga

23:48 pie_ has quit [Quit: Leaving]

23:53 azonenberg_work has quit [Ping timeout: 256 seconds]