##openfpga on 2019-08-06 — irc logs at freenode.irclog.whitequark.org

00:04 emeb has quit [Quit: Leaving.]

00:23 _whitelogger has joined ##openfpga

00:31 emeb_mac has joined ##openfpga

01:00 cr1901_modern1 has quit [Quit: Leaving.]

01:00 cr1901_modern has joined ##openfpga

01:01 implr has quit [Ping timeout: 258 seconds]

01:10 <whitequark> mwk: ohhh hm, you're right

01:10 <whitequark> lemme fix it

01:14 <mwk> hrm wtf

01:14 <mwk> another "fun" thing

01:14 <mwk> if you completely fuck up loading the bitstream and, in fact, never upload any valid data

01:15 <mwk> JSTART will *still* startup up the FPGA and light up DONE

01:15 <whitequark> but not ISC_DONE?

01:15 <mwk> ISC_DONE as well

01:15 <whitequark> i think i've seen that behavior, actually

01:15 <whitequark> hm

01:15 <whitequark> wait

01:15 <whitequark> are you sure this happens without a flash too?

01:15 <mwk> yes

01:16 <mwk> the jumper is set to JTAG-only

01:16 <mwk> heh

01:16 <mwk> try it

01:16 <whitequark> i can't

01:16 <mwk> just commend out the CFG_IN

01:16 <whitequark> my board isn't jumpered

01:16 <whitequark> i'd have to like desolder the flash

01:16 <whitequark> or i guess short it or something

01:16 <mwk> doesn't matter

01:16 <whitequark> ok sure

01:17 <mwk> just JPROGRAM + JSTART

01:17 <mwk> behavior is consistent with blank bitstream (ie. all outputs tristated, DONE high)

01:18 flea86 has joined ##openfpga

01:19 <mwk> I suppose that's weird but harmless for a blank bitstream (or something missing the sync word entirely)

01:19 <mwk> not so harmless for a configuration that failed because of CRC / IDCODE error though

01:20 <mwk> but, ugh

01:20 <mwk> there doesn't seem to be a JTAG register you could query for error status

01:20 <mwk> so you'd actually have to talk to it via CFG_IN / CFG_OUT?

01:21 <mwk> that just keeps getting worse, doesn't it...

01:22 <mwk> okay, it doesn't start for a CRC error, good

01:23 <whitequark> mwk: i commented CFG_IN

01:23 <mwk> I suppose it works for blank because it technically never triggers an error

01:23 <whitequark> doesn't start

01:23 <mwk> no DONE?

01:23 <whitequark> nope

01:23 <mwk> could be different on xc6s

01:23 <whitequark> yes, might have been a bug they fixed

01:25 <whitequark> fixed applet

01:25 <whitequark> how should i credit you btw

01:26 <mwk> realname, I guess

01:28 <mwk> eh

01:29 <mwk> I guess I'll try messing with CFG_IN/CFG_OUT now

01:29 <mwk> and/or maybe figure out wtf this ISC_* thing is

01:29 <whitequark> sounds good. I'm wondering about the bit order in ISC_*

01:29 <whitequark> btw do you have a glasgow

01:30 <mwk> no, using the builtin basys2 programmer

01:30 <mwk> hmm

01:30 <mwk> what is ISC anyway? I suppose I should read IEEE 1532 first?

01:30 <whitequark> in system configuration

01:31 <mwk> it's some kind of redundant interface to the same thing we're using, made in the name of conforming to some standard?

01:31 <whitequark> in my understanding yes

01:32 <whitequark> scihu has a copy of 1532

01:32 <whitequark> btw, ieee says that 1532 is "withdrawn", not sure what's up with that

01:32 <whitequark> a long time ago too

01:32 <whitequark> someone didn't pay the ieee rans^Wfees?

01:34 <whitequark> something i kinda want to have in the applet is readback...

01:34 <mwk> readback is tricky

01:35 <mwk> you have to actually send CFG_IN queries and get CFG_OUT responses

01:35 Bike has quit [Ping timeout: 248 seconds]

01:35 <mwk> and... well, there's the thing I don't really understand about it

01:35 <mwk> in normal JTAG, you have registers with well-defined lengths

01:35 <whitequark> yep

01:36 <mwk> if you have multiple devices in the chain, you just concatenate your shit with some BYPASS bits in the front and the back, and done

01:36 <mwk> but if CFG_IN is equivalent to just connecting TDI to DIN

01:36 <mwk> and your FPGA is not the first device in the chain

01:37 <mwk> you'll always submit some crap bits to it beforhand, whether you want it or not, correct?

01:37 <whitequark> yep.

01:37 <mwk> and for CFG_OUT, you'll always trigger a few more bits of readback than you are actually interested in

01:37 <whitequark> yep.

01:37 <mwk> so... how on earth are you supposed to do readback

01:37 <whitequark> i suspect that's why the ISC interface exists.

01:37 <whitequark> since it doesn't have infinite length registers.

01:38 <mwk> if your carefully-constructed commands are getting padded with shit

01:38 <whitequark> also. take a look at what iMPACT does maybe?

01:38 <whitequark> it can generate SVF files for multiple device chains

01:39 <mwk> and of course this board has an XCFsomething flash chip in front of the FPGA, so I'm going to run into this issue sooner rather than later

01:40 <mwk> ugh impact

01:40 <mwk> does this mean I'll actually have to install X on this machine...

01:41 <mwk> alright, so the bitgen -g IEEE1532:Yes option is boring

01:41 <mwk> it just packages the exact same bitstream (byte-reversed, or rather 32-bit word-reversed) into an .isc file

01:42 Bike has joined ##openfpga

01:43 <whitequark> hm, 32-bit word-reversed

01:43 <mwk> well

01:43 <whitequark> what's the size of ISC_PDATA register on your device?

01:43 <mwk> .isc is some text wrapping, with a comma-separated array of 32-bit words written as hex

01:44 <whitequark> because on xc6s it's 16 bit

01:44 <whitequark> but on some others it's 32 bit

01:44 <mwk> oh

01:44 <mwk> that's supposed to be like that

01:44 <whitequark> i'm not sure

01:44 <whitequark> i've never looked at 1532

01:44 <mwk> they changed the base word size for the bitstream from 32 btis to 16 bits on the Spartan 3E -> Spartan 3A jump

01:45 <mwk> ie. xc3sa, xc3sda, xc6s have 16-bit words for everything, everything else has 32-bit words

01:45 <whitequark> yep sounds about right

01:45 <azonenberg> mwk, whitequark: have you queried the INIT bit for that situation?

01:46 <whitequark> mwk: wait. what

01:46 <mwk> azonenberg: what situation? the JSTART with blank bitstream?

01:46 <whitequark> you mean *pre* xc3sa and *post* xc6s devices all have 32 bit words?

01:46 <azonenberg> Yeah

01:46 <azonenberg> whitequark: correct, XC6S was the lone departure

01:46 <mwk> INIT_B is 1

01:46 <azonenberg> almost everything else is 32

01:46 <azonenberg> XC6S is xilinx's windows ME

01:46 <mwk> not lone, this and Spartan 3A

01:46 <mwk> and yes

01:46 <mwk> xc6s is *special*

01:46 <whitequark> lol windows me

01:47 <mwk> it *is* windows me

01:47 <azonenberg> i'm serious, it was the architecture that was so bad they stopped having two product lines

01:47 <whitequark> looool

01:47 <mwk> right

01:47 <azonenberg> they killed it, and made spartan7 be a cut down virtex

01:47 <whitequark> what was the problem with it

01:47 <mwk> it's the single most fucked up family they made

01:47 <mwk> oh gods, lots of them

01:47 <azonenberg> it's literally the exact same thing ms did with winME

01:48 <mwk> I mean, eh

01:48 <mwk> lots of little things really

01:48 <azonenberg> mwk: also see jtaghal XilinxSpartan6Device.h#86 or UG380 table 5-35

01:48 <azonenberg> for spartan6

01:48 <azonenberg> crc and idcode error are clearly called out as specific state bits

01:48 <mwk> first, the whole chip is just... irregular as fuck

01:48 <azonenberg> yeah that alone is a reason why i dont think we will see a f/oss toolchain for it any time soon

01:48 <azonenberg> it's massively more work than something like 7 series

01:49 <mwk> virtex 4+ have clean column-based architecture

01:49 <mwk> virtex 2 / spartan 3 also have some sanity

01:49 <azonenberg> s6 doesnt even have square arrays

01:49 <azonenberg> they have random cutouts in the corners for GTPs

01:49 <whitequark> huh

01:49 <mwk> but s6 has just shit sprinkled around everywhere

01:49 <azonenberg> you have spots where there's tiny little peninsulas like two or three CLBs wide

01:49 <azonenberg> and if you manage to shove any logic in there, good luck making timing

01:49 <mwk> yes

01:49 <mwk> lots of them

01:50 <azonenberg> http://thanatos.virtual.antikernel.net/temp/wtfplacement.png

01:50 <mwk> then there's the IO clocking clusterfuck

01:50 <azonenberg> Big cutout = GTP

01:50 <whitequark> azonenberg: 404

01:50 <azonenberg> smaller cutout below it = PCIe IP

01:50 <mwk> https://0x04.net/~mwk/leohtml/6slx45t.html

01:50 <azonenberg> http://thanatos.virtual.antikernel.net/unlisted/wtfplacement.png

01:50 <mwk> here's the geometry

01:50 <azonenberg> oops

01:50 <mwk> green == CLB, pink-ish == IO

01:51 <mwk> just have a look at left/right side

01:51 <mwk> the big pieces of shit at the top are GTPs, the slightly smaller one on the left is PCIE

01:52 <mwk> also note the skipped CLBs near the clock column

01:52 <mwk> the carry chain just kind of jumps over these

01:53 <azonenberg> mwk: btw, are you actually doing dev on s6 tools?

01:53 <whitequark> azonenberg: wow the GTP is *huge*

01:53 <azonenberg> whitequark: serdes ip in general is massive

01:53 <mwk> azonenberg: I'm wrapping up the fuzzers this week

01:54 <mwk> still fighting IO clocks

01:54 <azonenberg> mwk: I might have an Atlys i can part with for a steep discount on MSRP

01:54 <azonenberg> If you want one for testing

01:54 <mwk> does it do GTPs?

01:54 <whitequark> mwk: that pcie macro isn't small either

01:54 <azonenberg> No, it's a LX45 without the -T

01:55 <mwk> not of much use to me then, unfortunately

01:55 <mwk> already stocked up on non-t

01:56 <mwk> and as for other things wrong with spartan 6

01:56 <azonenberg> http://siliconpr0n.org/map/xilinx/xc7a35t/mz_mit20x/#x=3188&y=2860&z=2

01:56 <azonenberg> this is a 7a15t/35t/50t die

01:56 <mwk> there are just so many differences from other families

01:56 <azonenberg> GTP quad bottom left, you can see the change in power distribution mesh at the boundary

01:56 <mwk> the clocking setup is batshit

01:56 <azonenberg> i want dig to delayer it at some point (it's his specimen)

01:56 <mwk> the IO clocks in particular

01:57 <azonenberg> note the tx/rx diffpairs paired out and fanning out to bond pads

01:57 <mwk> [it's the one last area I'm struggling with fuzzing FWIW -- the IO clocks]

01:58 <mwk> I've read the clocking resources guide a few times by now, and still have no fucking clue what's going on

02:01 <mwk> it has documentation for attributes that don't actually exist in ISE

02:01 <mwk> and doesn't have documentation for some attributes that do

02:01 <azonenberg> lol

02:01 <azonenberg> hey, at least UG380 fixed some of the bugs in the configuration sequences i found when writing jtaghal

02:02 <azonenberg> there are referenes to a couple of webcases in the jtaghal source

02:02 <azonenberg> saying "datasheet is wrong, see case #1234, here's how it actually works"

02:02 <azonenberg> but the later doc revs fixed that

02:03 s_frit has quit [Remote host closed the connection]

02:03 s_frit has joined ##openfpga

02:05 <mwk> azonenberg: and yet, still buggy

02:06 <azonenberg> at least its not as bad as the grenepak stuff was

02:06 <mwk> you're only sending 64 RTIs in JSTART, right?

02:06 <azonenberg> (doc wise)

02:06 <azonenberg> check the source, i think so

02:06 <azonenberg> when i had time to work on greenpak stuff that is

02:06 <azonenberg> it got to the point that i had direct email access to the engineer who wrote the datasheets

02:06 <mwk> then you'll run into problems with bitstreams that take more time than that

02:06 <azonenberg> i was practically sending him diffs

02:06 <azonenberg> IIRC the docs said you only needed 16

02:06 <azonenberg> and i sent more to be safe

02:07 <_whitenotifier-3> [whitequark/libfx2] whitequark pushed 1 commit to master [+0/-0/±1] https://git.io/fjQId

02:07 <_whitenotifier-3> [whitequark/libfx2] whitequark 9d6146a - Work around a bug in SDCC 3.5.

02:07 <mwk> and that's bullshit

02:07 <mwk> you may need *a lot* of cycles

02:07 <mwk> if you use LCK_Cycle, the DCM can take quite a bit of time to lock

02:07 <mwk> what you really need to do is poll for EOS

02:07 <mwk> *and* keep sending JSTART RTIs to it

02:07 <azonenberg> I do a single check right now, not a full polling loop

02:08 <azonenberg> i guess i never used LCK_Cycle

02:08 <azonenberg> so never hit that

02:08 <azonenberg> if you want to file a ticket against jtaghal i can add a fix when time permits

02:08 <_whitenotifier-3> [libfx2] Success. The Travis CI build passed - https://travis-ci.org/whitequark/libfx2/builds/568164835?utm_source=github_status&utm_medium=notification

02:08 <mwk> otherwise the startup machine won't proceed once it's done with locking

02:10 <mwk> sigh

02:10 <azonenberg> Side note, who decided LCK_Cycle was a good idea?

02:10 <mwk> UG380 really is crap

02:10 <azonenberg> This is what BUFGCE's are for

02:10 <azonenberg> Just gate your output clocks until the PLL output is stabke

02:10 <azonenberg> stable*

02:10 <azonenberg> (and of course DCMs are a whole other can of worms vs real PLLs...

02:11 <mwk> oh, that's part of the fun of spartan 6

02:11 <mwk> do you want a real PLL, or a DCM? you got both!

02:11 <azonenberg> yeeeeah

02:11 <azonenberg> but iirc you can only use the DCMs for certain things?

02:11 <mwk> heck, you can even chain them

02:12 <azonenberg> there's some weird dedicated paths

02:12 <mwk> nah

02:12 <azonenberg> i cant recall specifics, but there were situations when using a pll wasnt an option iirc

02:12 <azonenberg> maybe i just ran out of them?

02:12 <whitequark> what's a DCM?

02:12 <mwk> PLLs can route anywhere the DCMs can, and to some places they cannot as well

02:12 <mwk> whitequark: digital clock manager

02:12 <mwk> kinda-but-not-really PLL

02:13 <azonenberg> whitequark: it's basically a PLL with a variable length ring oscillator instead of a VCO

02:13 <whitequark> yeah like

02:13 <whitequark> oh thanks.

02:13 <whitequark> is that... why would you do that?

02:13 <azonenberg> so it's a giant ptv-dependent jitter machine

02:13 <mwk> on s6, you have two choices

02:13 <whitequark> yes. why the fuck would anyone want that

02:13 <azonenberg> along the same lines, the s6 IODELAYs are uncalibrated

02:13 <whitequark> oh great, so your board heats up adn it has to retrain DDR?

02:13 <azonenberg> so you cannot do static timing offsets, you have to experimentally figure out what gives the best BER

02:13 <whitequark> really?

02:14 <azonenberg> i mean you can do a static offset but it's PTV dependent

02:14 <azonenberg> meanwhile in 7 series, the delay lines are controlled by IIRC some kind of bias voltage generator and a feedback circuit

02:14 <azonenberg> that adjusts a reference delay line so... 32? taps equals one cycle of a provided refclk

02:14 <azonenberg> thus continually compensating for PTV

02:14 <mwk> DCM: takes a clock, deskews it wrt feedback clocks, has 9 outputs: 0/90/180/270 phase shift original clock, 0/180 phase shift original clock, original clock divided by a programmable integer (or integer and a half), and a 0/180 phase shift pair of a generated M/D clock

02:14 <azonenberg> this is how v6 works too

02:15 <azonenberg> s6 and s3 have DCMs because plls were expensive? :p

02:15 <mwk> and PLL: takes a clock, divides by D1, multiplies by M, and then has 6 outputs, each divided by independent D2

02:16 <mwk> + independent phase shifts for every output

02:16 <mwk> but there are twice as many DCMs as PLLs

02:16 <azonenberg> yeah, so it's helpful to use them if you can tolerate the downsides

02:16 <azonenberg> I don't miss leaving s6 one bit

02:16 <azonenberg> all of my projects are now vivado based and using SV on 7 series or ultrascale

02:17 <azonenberg> now i just need to find time to improve yosys sv support so prjxray will be able to work on my code...

02:17 <whitequark> please start by making always_ff actually do that

02:17 <whitequark> instead of being an alias for always

02:17 <mwk> whitequark: and oh yeah, delays are uncalibrated

02:17 <mwk> and so is on-chip termination

02:18 <azonenberg> oh yeah i forgot the termination was uncal too

02:18 <mwk> so you can select 50 Ohm impedance

02:18 <whitequark> that seems kind of a shit design, to put it bluntly

02:18 <azonenberg> OUT_TERM is just an alias for whatever drive strength is nominally kinda close to that

02:18 <mwk> and the data sheet says you may get 20 Ohm, or maybe 70 Ohm

02:18 <azonenberg> whitequark: as i've said many times before, i consider s6 to be NRND

02:19 <mwk> it kind of tries to make up for the "uncalibrated" part by having dynamically reconfigurable I/O though

02:19 <azonenberg> There is literally one thing it does better than 7 series, and that's low static power

02:19 <azonenberg> but it makes up for that by having much higher dynamic power

02:19 <mwk> and Xilinx memory controller IP does crazy shit with it to get DDR working

02:20 <whitequark> like what

02:20 <mwk> it basically does in gateware what Virtex does in hardware to do DCI / calibrated delays

02:20 <mwk> except shittier

02:20 <mwk> so you have programmable drive strength, right

02:20 <azonenberg> waaait the mig wrapper partial reconfigs the io *drivers*???

02:20 <azonenberg> i thought it just did delay calibration

02:21 <mwk> the mem IP uses a pin connected to a calibration resistor to kind of try different drive strengths

02:21 <azonenberg> that is even more cursed than i remembered the one time i dared look at the generated rtl

02:21 <mwk> and when it finds one it likes, it reconfigures all DQ pins

02:21 <azonenberg> are you serious

02:21 <mwk> and does that continuously

02:21 <mwk> azonenberg: completely

02:21 <whitequark> what the fuck

02:21 <mwk> I had a link somewhere...

02:22 <mwk> oh, and s6 is the only FPGA where you actually can reconfigure the drivers at runtime

02:22 <mwk> presumably only for that exact purpose

02:23 <mwk> azonenberg: oh, here

02:23 <mwk> https://github.com/open-ephys/rhythm/blob/master/mcb_soft_calibration.v

02:23 <mwk> https://github.com/open-ephys/rhythm/blob/master/mcb_soft_calibration.v#L262

02:24 <mwk> these 8 registers correspond 1-1 to the I/O buffer tile in the bitstream

02:24 <azonenberg> my eyes...

02:24 <mwk> https://github.com/open-ephys/rhythm/blob/master/mcb_soft_calibration.v#L690

02:24 <mwk> and here's the cursed state machine

02:25 <azonenberg> also wow the code readability there is atrocious

02:25 <azonenberg> no blank lines between blocks, etc

02:25 X-Scale has joined ##openfpga

02:30 <mwk> also, the other files were an "enjoyable" read as well

02:30 <mwk> the DRP port for the IOs is also quite... "special"

02:30 <azonenberg> i mean the 7 series PLL DRP is literally poking raw bitstream bits too

02:30 <azonenberg> but that's at least somewhat documented

02:31 <mwk> unlike all the other DRP ports in Xilinx (which are dead simple parallel read/write with plain address and data buses), it is serial

02:31 <mwk> kinda SPI

02:31 <mwk> and you always have to both read and write

02:31 <azonenberg> looool it's probably bypassing the frame logic

02:31 <mwk> yes

02:31 <azonenberg> and poking raw dff's

02:31 <azonenberg> i bet they bodged it on last minute

02:31 <azonenberg> and didnt have area to spare

02:31 <mwk> but then so is the normal virtex DRP

02:31 <azonenberg> so its a giant shif reg

02:32 <mwk> correct

02:32 <mwk> but not exactly

02:32 <mwk> it's quite a complex beast

02:32 <mwk> it's crap no doubt, but not last-minute crap

02:32 <whitequark> so to get background DRP, there's 2 FFs per 1 config bit right?

02:32 <whitequark> background DR*

02:33 <mwk> so you can select whether any given IO gets its own DRP port, or is connected to big DRP bus

02:33 <mwk> with the big DRP bus connected to the hard mem controller block

02:34 <mwk> not *driven* by it, just connected to it, it just kind of passes the signal through to interconnect logic

02:34 <mwk> in the "own port" case, you shift register address, then shift register data

02:34 <mwk> in the "bus" case, you have mixed parallel/serial addressing

02:35 <mwk> you select the I/O via parallel address line, but still select the register serially

02:35 <whitequark> so they're just saving interconnect lines to not have wide buses per IOB?

02:36 <mwk> yes

02:36 <mwk> and my favorite part is this

02:36 <mwk> every I/O has a programmable address on this (parallel) bus

02:37 <mwk> and for some crazy reason, the default address programmed by bitgen for unused I/Os is 0x13 (which you have to fuzz the FPGA to know)

02:38 <mwk> which the mem controller IP carefully avoids

02:39 <mwk> the whole thing looks like mem controller had a bug which caused it to accidentally reprogram some random I/Os, and they fixed it in the bitgen by changing the address for unused I/Os to something random which the IP happened to not be using...

02:39 <azonenberg> looooooool

02:40 <azonenberg> side note, if you ever figure out the root cause of the s6 9kb ram errata

02:40 <azonenberg> i would love to see some analysis on that

02:40 <azonenberg> (-g INIT_9K)

02:40 <mwk> it's pretty self-evident

02:40 <mwk> I mean, I don't have details yet

02:40 <mwk> but more or less what happens is

02:41 <mwk> the configuration logic kind of hijacks the normal BRAM ports to upload initial data

02:41 <mwk> standard procedure on all xilinx parts (and probably all FPGAs ever)

02:41 <azonenberg> well yeah you arent gonna add a third set of ports

02:42 <mwk> but, the bug is that if the ram is in 9k mode, you're restricted to writing half of it

02:42 <mwk> because they fucked up and didn't put an override in there for config mode

02:42 <mwk> so if you look at the bitstream

02:42 <azonenberg> So the high half of the 9k never gets written?

02:42 <azonenberg> sorry i mean, the 9k's in the second half of the 18k

02:42 <azonenberg> you can init the other half?

02:42 <mwk> it uploads the normal frames, except with 9k disabled

02:42 <mwk> then uploads the blockram frames

02:42 <azonenberg> oh, that's the bugfix?

02:43 <azonenberg> then they patch the mode bit

02:43 <azonenberg> that's actually really interesting

02:43 <mwk> and then sets FAR back to the frame with the mode bit and reupload it, yes

02:43 <azonenberg> you should do a little blog article on REing s6 errata

02:43 <mwk> I don't know what exactly happens when you config-write a ram in 9k mode, any kind of corruption could happen

02:44 <mwk> and fun fact

02:44 <mwk> the errata fix conflicts with bitstream encryption

02:44 <azonenberg> because you cant do partial overwrites with crypto

02:44 <mwk> because encryption requires you to always write the whole bitstream in one go

02:44 <azonenberg> yeah i think i recall reading about that

02:45 <mwk> as in, once you do an encrypted upload, you cannot overwrite anything without bonking on PROG_B and resetting all memory

02:45 <azonenberg> Yeah

02:46 <azonenberg> i guess the nicer option would have been to allow overwrites as long as the overwritten data was also encrypted and mac'd?

02:46 <mwk> they do that on 7 series

02:46 <azonenberg> i dont think that would have broken security

02:46 <mwk> cannot do it on spartan 6, because it doesn't support MAC

02:46 <azonenberg> aaah ok that makes sense then

02:47 <mwk> also I don't know how 7 series encryption works

02:47 <mwk> but on Spartan 6 it's... kind of the wrong level to enable secure partial reconfiguration

02:47 <mwk> the only thing that is encrypted is frame data, all aux config registers are unprotected

02:48 <azonenberg> i havent looked at 7 series crypto yet

02:48 <azonenberg> i havent gone below the frame layer

02:48 <azonenberg> but if i was doing it, i'd probably go AES-GCM on entire frames

02:48 <mwk> oh, and in case you want another reason why spartan 6 is batshit insane

02:49 <mwk> it doesn't have constant-length frames

02:49 <mwk> as opposed to... well, everything else

02:49 <mwk> you have 3 frame types on 6s

02:50 <mwk> the normal frames, which are always 16×64+16 bits long, and correspond to a 16-CLB-tail slice of the bitstream

02:51 <mwk> the blockram frames, which... to be honest I'm not sure what size they are (I could be off by power of two), but definitely constant and likely the same size as normal frames

02:52 <mwk> and the single extra-special IOB frame (singular), which consists of however many bits are required by the IOBs in the device

02:52 <mwk> 64 bits per IOB plus 384 bits per IO clock tile (one per side)

02:52 <mwk> and it's kind of circling around the whole device

02:55 <azonenberg> mwk: well you know how cursed the coolrunner bitstreams are right?

02:55 <mwk> I've heard bits of that

02:55 <mwk> but haven't looked closely

02:55 <azonenberg> the 2c32a is a 48 row x 260 bit array

02:55 <azonenberg> that is literally splatted right into the physical layout of the chip

02:55 <mwk> you know what

02:55 <mwk> so are FPGAs

02:56 <azonenberg> noo you misunderstand though

02:56 <azonenberg> the generated .jed file is logically addressed

02:56 <azonenberg> so you have a block for the and array, a block for the or array, a block for macrocells, a block for routing

02:56 <azonenberg> then you have to go through a giant undocumented permutation to put the bits from that into the jtag shift register

02:56 <azonenberg> mirroring things and interleaving the various portions of the array as you go

02:57 <azonenberg> I RE'd the permutation and it maps perfectly to the die layout

02:57 <azonenberg> and it made REing the bitstreams vastly easier

02:57 <azonenberg> But why not generate a bitfile in physical address space?

02:57 <azonenberg> a jed is supposed to be something you can serialize right out to the DUT

02:57 <azonenberg> it isnt supposed to need a MMU in the way :p

02:57 <mwk> yeah

02:57 <mwk> heh

02:58 <mwk> btw, as for what's below frame level...

02:58 <whitequark> azonenberg: similar for xc9572

02:58 <whitequark> which i re'd

02:58 <mwk> it's the same kind of hell

02:58 <whitequark> *xl

02:58 <azonenberg> side note, i love this community

02:58 <azonenberg> being able to actually complain about the cursedness of fpga bitstreams *and have somebody understand*

02:58 <azonenberg> lol

02:59 <mwk> bits arranged in whatever two-dimensional patterns will route nicely to the underlying gates

02:59 <azonenberg> yeah but that makes sense

02:59 <azonenberg> the .bit file has them in that order

02:59 <mwk> consider the Virtex 6 and 7 CLBs

02:59 <azonenberg> you arent optimizing for readability of the bitstream

02:59 <mwk> they are functionally completely identical

02:59 <azonenberg> the coolrunner jeds are very readable

02:59 <azonenberg> even with comments

03:00 <mwk> same capabilities, same attributes, exact same bits

03:00 <azonenberg> and wait, v6 and 7 series clbs are the same? they didnt change anything at all in the primitive structure?

03:00 <mwk> hell, the bitstream tile has the same dimensions

03:00 <azonenberg> just re-layout?

03:00 <mwk> but... the bitstream arrangement is different

03:00 <azonenberg> well yeah

03:00 <mwk> someone spent a lot of time moving these bits around to correspond to 28nm geometry

03:00 <azonenberg> i know the s6 clb is way different

03:00 <mwk> oh, it's not way different

03:00 <mwk> just a bit different

03:01 <azonenberg> Die, SLICEX

03:01 <azonenberg> killing that was the single best thing about 7 series

03:01 <azonenberg> i can put adders and wide muxes in every slice now

03:01 <mwk> eh

03:02 <mwk> right, SLICEX

03:02 <mwk> kind of forgot it was a thing

03:02 <mwk> fun fact: the documtation of that thing is buggy

03:03 <azonenberg> oh?

03:03 <mwk> shows a mux path that isn't there (O6 -> [ABCD]MUX)

03:03 <whitequark> what's up with AR34541?

03:03 <mwk> really non-obvious, because the output obviously exists, and is routable to [ABCD]MUX in SLICEM and SLICEL

03:04 <mwk> and when you emit .xdl files because you want to do your own P&R, the xdl hw description also lists O6 as a valid choice for that mux

03:04 <mwk> but xdl will fail conversion with some non-descript error message referring to random things

03:05 <mwk> I've had a *lot* of fun figuring out the root cause, because the dumb thing never said what its problem was

03:06 <mwk> whitequark: huh, no idea; just some random bug?

03:07 <mwk> I mean, the blockram really is crap on this device

03:07 <mwk> (see the INIT_9K discussion above)

03:07 <whitequark> mm

03:07 <mwk> also, xdl allows you to specify a 16kbit SDP RAM

03:08 <mwk> but it's not mentioned anywhere in the datasheets as an allowable configuration

03:08 <mwk> my guess is it turned out broken as well

03:09 <mwk> oh, and my favorite part of spartan 6 has to be the low-power version

03:10 <mwk> which is probably the exact same chip except you power it with 1V instead of 1.2V

03:10 <mwk> and apply a hilarious amount of workarounds to make it actually work

03:12 <mwk> as in, the datasheet has a long list of features you're not supposed to use, the generated bitstreams are slightly different, and synthesis has to insert some "fix-up" circuitry to look at undocumented DCM status bits and bonk on the reset button enough times to make it lock correctly

03:14 <whitequark> *what*

03:15 <whitequark> it does *fucking what*

03:15 <mwk> https://0x04.net/~mwk/xidocs/ug/xc6s-clk.pdf

03:15 <mwk> pages 81, 82

03:15 <mwk> enjoy

03:15 <mwk> don't ask me what this does, I have no clue

03:17 <whitequark> ...

03:18 <sorear> ecp5 5g’s evil twin?

03:51 Bike has quit [Quit: Lost terminal]

03:54 <Sprite_tm> Whaoh, and I thought some of the workaround the CPU people try to sneak past the radar are funky sometimes.

03:58 m_w has joined ##openfpga

04:31 <mwk> oh heh

04:31 <mwk> and I just remembered

04:32 <mwk> on the topic of crazy circuits inserted by xilinx software

04:33 <mwk> let me find a link...

04:35 <mwk> ahh, there it is

04:35 <mwk> so you have this shiny new Virtex 2 Pro FPGA with two hard PPC cores embedded (the left core and the right core)

04:35 <mwk> and it's a -7 speed grade FPGA, too

04:36 <mwk> so you can just connect a 350MHz clock to the PPC cores and everything will be fine

04:37 <mwk> but it turns out that these PPC cores could do 400MHz, but there's a little problem with the clock circuit

04:37 <mwk> and Xilinx provides a handy little macro that you can insert in yout clock path and fix it so that the left PPC core can do full 400MHz

04:38 <mwk> (the right PPC core doesn't need the macro and can always do 400MHz for some reason)

04:38 <mwk> https://www.xilinx.com/support/documentation/application_notes/xapp755.pdf

04:39 <mwk> so I've looked at the macro and... it does duty cycle distortion, for some reason the PPC core needs slightly longer 0 period than 1 period to do full 400MHz

04:40 <mwk> the macro is actually hand-placed and hand-routed, and consists of: 1 LUT1 that buffers the incoming clock, and 1 LUT2 that is just an AND gate that ands the clock with itself

04:40 <mwk> but the two inputs are routed differently; one is a direct connection, while the other is routed several columns over and back

04:41 <Sprite_tm> Huh. Instant lower duty cycle thanks to light speeds.

04:41 <mwk> effectively reducing the clock high period by the routing delay

04:41 <mwk> yup

04:42 <Sprite_tm> Gotta give props to the engineer who went 'I think I can fix this, but it's going to be tricky...'

04:46 <mwk> certainly a clever solution

04:48 <mwk> I also particularly love how that application note never mentions anything about duty cycles

04:48 <mwk> it just says that the clock requires "special considerations"

04:49 <Sprite_tm> *deeper magic* /me handwaves... You're not supposed to understand this...

04:50 Richard_Simmons3 has joined ##openfpga

04:52 <mwk> just use our... special considerations insertion macro

04:54 Richard_Simmons has quit [Ping timeout: 264 seconds]

05:23 rohitksingh has joined ##openfpga

05:24 <sorear> I guess you could call that "light" speeds

05:41 Jybz has joined ##openfpga

05:57 mkdir has joined ##openfpga

05:57 <mkdir> huddo

05:57 <mkdir> how do i send serial data from ice40 to the comp

05:57 <mkdir> over uart - is it $display?

05:57 <mkdir> or some other way

06:12 <sensille> mkdir: erm. normally i add a uart and fifo to my design and the state machine to fill it. $display is for simulation

06:12 <mkdir> thanks sensille

06:13 <mkdir> im currently looking for tuts, but can't find anything...

06:13 <sensille> it's not the easiest way to get debugging information

06:13 <mkdir> is it hard to add uart and fifo

06:13 <sensille> if that's what you want

06:13 <mkdir> I just need to get sensor data to plot

06:13 <mkdir> is there a better way?

06:13 <sensille> so it's a regular part of your design?

06:14 <tnt> for debug, if simulation isn't possible you can export some internal signals to pins and use a logic analyzer.

06:15 <sensille> or use some internal debug/LA core

06:15 <mkdir> sensille yes

06:15 <tnt> sensille: on an ice40 it's rarely an option

06:15 <sensille> but to regularly transport data to a PC uart is fine, especially because it's easy to receive

06:16 <mkdir> cool yeah it is a regular part of my design

06:16 <mkdir> as I mentioned, I wanna run some computation and plot data

06:17 <tnt> mkdir: What board is that on ?

06:17 <mkdir> lattice ic40

06:17 <mkdir> ice40

06:17 <sensille> mkdir: i'm using this uart in my design: https://github.com/cyrozap/osdvu

06:17 <mkdir> cool what's the fifo?

06:17 <tnt> board, not chip

06:17 <sensille> (just a more or less random choice)

06:17 <mkdir> tnt icestick

06:19 <sensille> there are tons of fifos available, but i just wrote my own stupid one

06:19 <mkdir> what is the fifo used for? I'm not familiar

06:19 <mkdir> I have used the uart communication protocol before though

06:19 <sensille> so you can generate data in burst, faster than they are transmitted

06:20 <mkdir> oh I see, it's a queue

06:20 <sensille> your design might or might not need it

06:20 <sensille> yeah

06:20 <tnt> yeah, on the icestick, uart is pretty much the only option to talk back to the PC

06:20 <mkdir> alright thanks tnt

06:20 <sensille> PC or *pi?

06:20 <mkdir> PC

06:21 <mkdir> basically I for not I have a clock circuit or clock divider and I just need to send data when it switches from high or low

06:21 <mkdir> so that I can plot it and see the signal

06:22 <sensille> probably enough to hold the data in a register and have a state machine that sends it to the uart

06:23 <sensille> tnt: i don't know the icestick, does it have an external uart you can use or would you implement your own in fpga?

06:26 <tnt> sensille: you need to implement it in the fpga, but it has a couple of pins connected to a FTDI

06:28 <mkdir> thanks both - I will try implementing this tomorrow

06:28 <mkdir> gn

06:28 <sensille> have fun

06:30 <mkdir> ty

06:30 <mkdir> and also ty for the link

06:30 mkdir has quit [Remote host closed the connection]

07:20 emeb_mac has quit [Ping timeout: 245 seconds]

07:33 zng has quit [Ping timeout: 245 seconds]

08:03 implr has joined ##openfpga

08:12 Jybz has quit [Remote host closed the connection]

08:39 implr has quit [Ping timeout: 268 seconds]

08:39 q3k has quit [Ping timeout: 264 seconds]

08:49 s_frit has quit [Remote host closed the connection]

08:49 s_frit has joined ##openfpga

08:58 Jybz has joined ##openfpga

09:38 implr has joined ##openfpga

10:32 _whitelogger has joined ##openfpga

10:49 m4ssi has joined ##openfpga

11:43 rohitksingh has quit [Ping timeout: 246 seconds]

11:53 rohitksingh has joined ##openfpga

11:54 <ZirconiumX> My DE-10 arrived. Hopefully it keeps its magic smoke safely contained inside, but we'll see.

12:02 Jybz has quit [Ping timeout: 252 seconds]

12:34 flea86 has quit [Quit: Goodbye and thanks for all the dirty sand ;-)]

12:35 rohitksingh has quit [Read error: Connection reset by peer]

13:25 emeb has joined ##openfpga

14:04 s_frit has quit [Remote host closed the connection]

14:04 s_frit has joined ##openfpga

14:20 futarisIRCcloud has quit [Quit: Connection closed for inactivity]

14:22 s_frit has quit [Remote host closed the connection]

14:22 s_frit has joined ##openfpga

14:28 <mwk> whitequark: a few initial observations on ISC

14:29 <mwk> first, stuffing any of the 5 ISC opcodes (ISC_ENABLE, ISC_DISABLE, ISC_NOOP, ISC_PROGRAM, ISC_READ; presumably more for newer FPGAs) into IR instantly activates HIGHZ mode

14:29 <mwk> the FPGA keeps running, but with unconnected outputs

14:30 <mwk> further, ISC_ENABLE is kind of equivalent to JSHUTDOWN and ISC_DISABLE to JSTART, but with one extra thing

14:30 <mwk> there is this ISC_ENABLED flag in status register; ISC_ENABLE sets it once it's done with the shutdown sequence and ISC_DISABLE unsets it once it's done with startup sequence

14:31 mumptai has joined ##openfpga

14:31 <mwk> ISC_ENABLED == 1 also forces the FPGA to HIGHZ mode, so if you ISC_ENABLE and then JSTART, the FPGA goes through startup and is running, but still in HIGHZ mode until you actuall do ISC_DISABLE

14:32 <mwk> oh, and you clock both of these sequences using RTI like JSTART/JSHUTDOWN

14:33 <mwk> so... ISC_PROGRAM/ISC_READ is unfortunately unsuitable for live reconfig / readback because of the HIGHZ thing :(

14:45 <mwk> further

14:45 <mwk> executing JSHUTDOWN once the device is configured doesn't clear the ISC_DONE flag (aka the EOS flag)

14:46 <whitequark> yep, seems right

14:46 <mwk> so when starting the device back up with JSTART, there is no way to know whether you're actually done just by looking at the status register

14:46 <whitequark> you reverse engineer it faster than i can write it up :S

14:47 <mwk> you can probably tell by looking at the config port STATUS register

14:47 <mwk> but if you do this via ISC_PROGRAM/ISC_READ, you enable the HIGHZ mode and lose

14:47 <mwk> and if you want to do it via CFG_IN/CFG_OUT, you have to figure out how to deal with a non-single device on the chain...

14:51 <whitequark> CFG_IN is fine

14:51 <whitequark> there's a noop instruction right

14:52 dh73 has joined ##openfpga

14:53 <mwk> well yes

14:53 <whitequark> 2000...

14:53 <whitequark> wait, they go in MSB first

14:54 <whitequark> so the FPGA can be... only the second in the scan chain?

14:54 <mwk> but if you want to use it to read back a config port register, you have to synchronize it and figure out how to deal with extra bits you have to pad

14:54 <mwk> yes

14:54 <mwk> the board I'm using has exactly that

14:54 <whitequark> what does 0000 do?

14:55 <whitequark> i know iMPACT uses the 0000 command

14:55 <whitequark> or whatever it is

14:55 <mwk> it is AVR programmer -> xilinx PROM -> FPGA -> AVR

14:55 <mwk> hmm, or is it FPGA -> PROM

14:55 <mwk> doesn't matter; I've seen both configurations already

14:56 <mwk> 0000 is a noop

14:56 <whitequark> oh then this isn't a problem

14:56 <mwk> isn't it?

14:56 <whitequark> because BYPASS is loaded with 0

14:56 <mwk> yes, but

14:56 <mwk> you still have to time the CFG_IN -> CFG_OUT change right, I think

14:57 <whitequark> you have to shift 16-prefix zeroes in at the start

14:57 <whitequark> and shift 16-suffix more dummy bits out of CFG_OUT

14:57 <whitequark> that's not so hard

14:57 <whitequark> but it requires making the TAPInterface abstraction in Glasgow a bit more leaky

14:57 <whitequark> but that's fine I guess

14:57 <mwk> yes

14:58 <mwk> unless you have more than 16 devices you have to BYPASS in front / behind it

14:58 <mwk> though I suppose if you have 16-device scan chains you're already in hell

14:59 <whitequark> x=16-prefix; while x<0: x+=16

14:59 <mwk> hmm

15:00 <mwk> oh, you're right

15:00 <mwk> of course

15:00 <mwk> the only potential issue is whether the extra read cycles you perform via CFG_OUT will be a problem

15:00 <whitequark> yes

15:00 <whitequark> i *think* it's not actually an issue

15:00 genii has joined ##openfpga

15:00 <mwk> yeah, shouldn't be

15:00 <mwk> alright

15:00 <mwk> let's try out some shit

15:01 <mwk> *sigh* so what the fuck is ipython doing

15:01 <mwk> In [284]: byterev

15:01 <mwk> Out[284]: b'\x00\x80@\xc0 \xa0`\xe0\x10\x90P...

15:01 <mwk> In [285]: bytes(byterev[x] for x in b'abcd')

15:01 <mwk> NameError: name 'byterev' is not defined

15:02 <mwk> does IPython.embed() give you some kind of fucked up class scope...

15:04 <whitequark> mwk: also they mention some sort of "sync word"

15:04 <whitequark> oh, aa995566

15:04 <mwk> yes

15:04 <whitequark> yeah i'm guessing you can actually shift absolutely whatever into CFG_IN *if* you go through TLR

15:05 <mwk> hmm

15:05 <mwk> how?

15:06 <whitequark> i think it ignores everything before aa995566

15:06 <mwk> yes, that is correct

15:06 <whitequark> fun thing to test: does it even need you to maintain byte boundaries?

15:06 <mwk> nope

15:06 <whitequark> then you don't need to care about the devices before it in the chain at all

15:06 <whitequark> just go through TLR

15:07 <mwk> hmm, does TLR desync the config machine?

15:07 <whitequark> the docs seem to imply that

15:07 <mwk> that does make sense

15:07 <whitequark> oh, you sort of do need to care, i think

15:07 <whitequark> to push the tail of the configuration in

15:07 <whitequark> but that's easier

15:07 <whitequark> oh, and that wouldn't even violate the abstraction, right?

15:07 <mwk> right, you just pad it with extra nops

15:07 <whitequark> because you already shift in the suffix

15:07 <whitequark> no, you don't need extra nops

15:07 <mwk> oh

15:07 <mwk> right

15:07 <whitequark> you already shift in 0 there

15:08 <mwk> yes, of course

15:08 <whitequark> and actually, no

15:08 <whitequark> you don't need extra nops because the device won't see those extra bits

15:08 <whitequark> they'll get stuck in BYPASS before it

15:08 <mwk> correct

15:09 rohitksingh has joined ##openfpga

15:09 <whitequark> btw re our convo earlier about ISC_PROG

15:10 <whitequark> i looked at the SelectMAP interfac

15:10 <whitequark> it is *also* bit-reversed per byte regardless of width

15:10 <whitequark> so i *think* the ISC_PROG register will be bit-reversed per byte regardless fo its width

15:10 <mwk> that's just Xilinx numbering shit from MSB though, isn't it?

15:10 <whitequark> but not sure

15:10 <whitequark> it has a discontiguity in the middle

15:11 <mwk> but yes, if the .isc file bitgen spit out is correct, it's going to be bit-reversed

15:11 <mwk> hm no

15:11 <whitequark> per byte or per word?

15:11 <mwk> .isc is word bit-reversed

15:11 <whitequark> hm wtf

15:12 <whitequark> have you tried programming it yet

15:14 * daveshah

15:14 <mwk> no, I'm trying to get my tools to stop being shit first

15:14 <mwk> also I'm testing a different thing right now

15:14 <daveshah> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++

15:14 <daveshah> +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++

15:14 <daveshah> crap sorry

15:15 <daveshah> shouldn't have been trying to fix keyboard while reading irc

15:15 <mwk> that never ends well

15:17 <mwk> what.

15:18 <mwk> so now I'm shifting in CFG_IN and *not* discarding the TDO output

15:18 rohitksingh has quit [Ping timeout: 258 seconds]

15:18 <mwk> it seems to be spewing out random crap, unrelated to what I put in

15:19 <mwk> hm, or are my tools being shit again

15:21 <whitequark> it might be floating

15:22 <whitequark> lemme check with glasgow

15:22 q3k has joined ##openfpga

15:22 <mwk> okay, so what happens is

15:23 <mwk> CFG_IN's TDO is whatever was on CFG_IN's TDI 64 cycles ago

15:23 <mwk> (persisted across IR changes including JPROGRAM)

15:23 <whitequark> iiinteresting

15:23 <mwk> which matches up with something I recall reading in one of the config guides, not sure for which FPGA

15:24 <whitequark> ganged configuration?

15:24 <mwk> which said that you have to align config packets going into JTAG to 64 bits, not 32 as should be expected

15:24 <mwk> and if you're doing readback, you may or may not have to insert a 32-bit nop, depending on where you are in the 64-bit alignment

15:25 <mwk> this is... not something I'm happy about

15:25 <mwk> (not sure what the equivalent numbers are for xc6s)

15:25 <mwk> whitequark: nah, I think there just is a 64-bit shift register which controls the packet parser

15:26 <mwk> and the bits falling off the end are kind-of an accident

15:28 <whitequark> lol

15:31 <mwk> Configuration Register (Boundary-Scan)

15:31 <mwk> The configuration register is a 64-bit register. This register allows access to the

15:31 <mwk> configuration bus and readback operations.

15:31 <mwk> very helpful, user guide, very helpful

15:32 <whitequark> wait

15:32 <whitequark> on xc6s it's 16-bit

15:32 <mwk> ah, found it

15:33 <whitequark> on xc3e it's 64-bit *but* the ISC_PROG is 32bit?

15:33 <mwk> https://0x04.net/~mwk/xidocs/ug/xc2v-ug.pdf page 321

15:33 <whitequark> hwaet?

15:33 m4ssi has quit [Remote host closed the connection]

15:33 <mwk> All packet headers pass through a 64-bit buffer before reaching the packet processor; the

15:33 <mwk> packet processor itself interprets all commands in 32-bit words. To flush the last command

15:33 <mwk> in a sequence from the packet buffer, a sequence of configuration commands must end

15:33 <mwk> with least four 32-bit NOOP commands. Typically, this is found at the end of a bitstream, as

15:33 <mwk> shown in Table 4-18. JTAG read operations must consist of an even number of words in the

15:33 <mwk> command sequence.

15:33 <mwk> so

15:33 <mwk> the bitstream is 32-bit word oriented, but there's a 64-bit buffer along the way

15:34 <mwk> whitequark: the xc6s bitstream is 16-bit word oriented, that's for sure

15:34 <whitequark> ah

15:34 <mwk> but who knows what kind of buffers are there

15:34 <mwk> and xc6s config documentation is shit compared to xc2v, because you're not actually supposed to reconfigure spartans

15:34 <whitequark> "four NOOPs to clear the packet buffer"

15:34 <whitequark> so it's a 64-bit buffer too

15:35 <mwk> plausible

15:35 <whitequark> lemme try to measure DR length

15:38 <mwk> ISC_PROGRAM is 32 bits long though

15:39 <azonenberg> mwk: oh yeah this was the part of ug380 i found bugs in

15:41 <mwk> so how does this work

15:41 <mwk> 64-bit shift register, that much is clear

15:42 <azonenberg> With SLR-based FPGAs its even more fun

15:42 <mwk> and every 64 bits, there is a strobe that pokes packet processor, presumably

15:42 <azonenberg> i dont think i ever fully got that debugged

15:42 <mwk> with the strobe getting initially aligned when it finds the sync word

15:42 <azonenberg> but basically the SLR FPGAs are several dies with TDI/TDO ganged

15:42 <azonenberg> except all but one die are in a slave mode with no idcode or bypass register

15:43 <mwk> now how does this relate to normal JTAG capture/load flow

15:43 <azonenberg> So you get a m* wide IR and have to splat the IR out to all of the dies

15:43 <mwk> *sigh* ir probably doesn't

15:44 <mwk> azonenberg: ... and there's a special IR value that *does* include the other dies in the DR chain, yes

15:44 <mwk> it's "fun"

15:44 <mwk> so have you actually poked at one?

15:44 <azonenberg> I have a pair of vu9p's

15:44 <azonenberg> vcu118s

15:45 <azonenberg> i played with it but dont think i got them to boot fully yet

15:45 <azonenberg> need to get back to that soon

15:45 <mwk> I only had the pleasure of looking at a bitstream for one of those

15:45 <mwk> it's kind of batshit

15:45 <mwk> so it configures the first die, starts it up, desynchs the config interface

15:45 <azonenberg> yes

15:46 <azonenberg> I call that extra instruction "SLR_BYPASS"

15:46 <mwk> and then goes "ah, changed my mind", resyncs config, SHUTDOWNs the first die

15:46 <mwk> and uses a secrit command to enable bitstream forwarding to the *second* die

15:46 <whitequark> wtf

15:46 <mwk> then it starts the second die

15:46 <azonenberg> There's also an ISC_NOOP in there somewhere that i dont fully understand the point of

15:47 <azonenberg> nto mentioned in the datasheet but the generated bitstreams have it

15:47 <mwk> then *again* desynchs it, resynchs it, and configures the third die

15:47 <mwk> I think the startup sequence doesn't actually finish because DONE is still being pulled low by the other dies

15:47 <whitequark> i remember that ganged tdi/tdo is a common arrangement in die stacks

15:47 <mwk> but still, what

15:47 <azonenberg> whitequark: it isnt fully ganged

15:47 <whitequark> i mean

15:48 <azonenberg> because IR is 6*ndies bits wide

15:48 <whitequark> cjtag explicitly calls this out as a reason jtag is bad for multidie

15:52 <azonenberg> anyway i want to get this working once i set up the virtex boards in the new lab

15:52 <azonenberg> which is a few days out probably

15:52 <azonenberg> the big holdup right now is getting fiber pulled to the lab benches so i can get 10/40GbE from them to the core switch

15:52 <azonenberg> and the fiber is taking its sweet time to ship since its all custom lengths

15:55 <mwk> and *why*

15:55 <mwk> I mean, it'd be so simple to just make the thing visible as 4 devices on the chain

15:56 <mwk> but it has to pretend to be one device with a single-bit BYPASS register and all

15:56 <daveshah> I guess the whole marketing image is a single "3D FPGA" not four cheaper FPGAs and some superglue

15:56 <whitequark> lmao

15:57 <mwk> right, "3D"

15:57 <mwk> my ass

15:57 <mwk> it's the same story with Virtex 2/4/5 PPC cores btw

15:57 <mwk> they have their IRs in series with the FPGA one

15:58 <mwk> but they pretend to be one device with the FPGA skipping the PPCs in the DR chain unless you load the special "PPC bypass" opcode

15:58 <azonenberg> meanwhile zynq is sane

15:58 <azonenberg> in this regard

15:58 <whitequark> mwk: wait. what

15:58 <azonenberg> its an arm soc with an fpga bolted on and looks like that over jtag

15:58 <mwk> though I guess they haver a better rationale for the PPCs

15:58 <whitequark> *skipping* the PPCs in the DR chain?

15:59 <mwk> because the JTAG chain is actually connected to the PPCs in soft logic

15:59 <mwk> and it would be a shame if putting the FPGA in programming mode severed the JTAG chain

15:59 <mwk> maybe that's the precedent for the SLR bullshit

16:00 <azonenberg> meanwhile zynq has a tap for the FPGAs and a config strap to specify whether the arm should be in series with that tap

16:00 <azonenberg> or routed out to fpga fabric

16:00 <azonenberg> in the latter case you can jtag the arm via any four fpga gpios or a soft jtag master in the fpga fabric

16:02 <mwk> I guess that's because zynq is a CPU with an FPGA bolted on

16:02 <mwk> while Virtex 5 is an FPGA with a CPU bolted on

16:05 <mwk> whitequark: it's a clever thing really

16:05 <mwk> so the issue is that you want these PPC cores to be JTAGable by either fabric or external master

16:06 <mwk> and the cores export their JTAG pins to the fabric as ordinary interconnect i/o

16:06 <whitequark> mwk: no, i get that

16:06 <whitequark> i mean

16:06 <mwk> then, you have this JTAGPPC singleton primitive which you can connect to these if you want to hook them up to FPGA JTAG

16:06 <nats`> guyz guyz....

16:06 <whitequark> oh, nvm, i misread

16:06 <nats`> please let the dead alone

16:06 <whitequark> it makes total sense

16:07 <nats`> RIP V2P V4P and V5

16:07 <whitequark> misread if as else

16:07 <whitequark> if as unless*

16:07 <mwk> and to avoid thigns getting screwy, how it works is

16:07 <daveshah> nats`: this kickstarter doesn't agree

16:07 <mwk> for DR, the TDO is only connected to FPGA's TDO if you shift in the PPC bypass IR

16:07 <daveshah> https://www.kickstarter.com/projects/1962283735/novapi-np01-a-stackable-virtex-5-fpga-hat-for-raspberry-pi/description

16:07 <nats`> ....

16:08 <nats`> oO

16:08 <mwk> and for IR, the FPGA handles it on its own, it has a 14-bit shift register (6 bits for FPGA, 8 bits for PPCs)

16:08 <nats`> what a loss

16:08 <mwk> and it mirrors the IR for PPCs into JTAGPPC's TDI

16:08 <mwk> so if PPC JTAG is broken, it doesn't break the IR chain

16:08 <nats`> I worked with those monster and I'll not get back from Serie 7 and US.... (unless for S6 if needed)

16:09 <whitequark> mwk: so, i measured CFG_IN/CFG_OUT on xc6s

16:09 <whitequark> it's 16 bit

16:10 <mwk> what

16:10 <whitequark> D: g.applet.interface.jtag_probe: JTAG-H: scan dr length=16 data=<0000000000000000>

16:10 <whitequark> don't ask me lol

16:10 <mwk> so the datasheet lies?

16:10 <mwk> I mean

16:10 <mwk> about the 4 words alignment

16:10 <whitequark> i... have no idea?

16:10 <mwk> sigh

16:11 <whitequark> my brain hasn't booted up yet so i can't check it

16:11 <mwk> there could still be buffering, just somewhere else

16:11 <azonenberg> daveshah: why the hell would you use a virtex 5 in this day and age

16:11 <azonenberg> an artix7 is going to be faster, cheaper, more power efficient, and more

16:11 <mwk> cool hat

16:11 <daveshah> I can only assume someone has a big pile of them

16:11 <nats`> 25 daveshah

16:11 <nats`> that's a small pile

16:11 <azonenberg> mwk: and yeah i suspect it's a 4 word fifo that strobes every 16 tck cycles?

16:11 <nats`> but still apile

16:15 <nats`> want a rpi style fpga system.. https://shop.trenz-electronic.de/en/TE0726-03M-ZynqBerry-Zynq-7010-in-Raspberry-Pi-form-factor

16:16 <emeb> I've gotten a lot of use out of a MiniZed I bought last year. Cheap, plenty of I/O, reasonably good support.

16:17 <nats`> yep

16:17 <nats`> using the microzed on my side

16:17 <nats`> integrated on a custom carrier :)

16:17 <emeb> built some I/O boards for it (Arduino headers, PMODs) and did some SDR experiments.

16:18 <ZirconiumX> I'm kinda curious why the MiSTer project picked the DE-10 Nano as a base platform

16:19 <emeb> 6 vs 1/2dozen - just depends on what the originators were familiar with. Terasic stuff is reasonably priced.

16:21 <ZirconiumX> True I suppose

16:21 <emeb> I was working a contract for some DSP stuff a few years ago and we were choosing an FPGA SoC platform. At that time the Cyclone boards from Terasic were a lot easier to find and cheaper than the Zynq based stuff.

16:22 <ZirconiumX> That's fair

16:22 <emeb> That was before Digilent released the first Zybo, so the only things you could get for Zynq were the full-sized Zedboards and they were in the $500 range...

16:23 <ZirconiumX> I think the DE-10 will probably be the initial target board for Mistral

16:23 <ZirconiumX> ~~simply because I have one~~

16:24 <emeb> Yep.

16:24 <emeb> Now of course you can get fully featured Z-Turn Lite boards for ~$60.

16:25 <emeb> Only downside to those is the I/O is on high-density connectors that are a bit harder to use than PMODs.

16:27 <mwk> hmm wtf

16:27 <mwk> ISC_PROGRAM is 32 bits long

16:27 <mwk> but ISC_READ is.. 69 bits long

16:28 <whitequark> nice

16:28 <mwk> certainly, but wtf

16:28 <whitequark> uh... valid, valid_hi, valid_lo, word_hi, word_lo?

16:29 <mwk> and ISC_NOOP is 5 bits long

16:35 <whitequark> mwk: oh btw

16:35 <whitequark> ISC_NOOP selects "ISC_CONFIG"

16:35 <whitequark> same as ISC_ENABLE and ISC_DISABLE

16:35 <whitequark> accroding to bsdl

16:35 <mwk> yes

16:35 <whitequark> wait

16:35 <whitequark> wait, no

16:35 <whitequark> it selects "ISC_DEFAULT"

16:35 <mwk> it seems IEEE 1532 defines a semi-standard status register

16:35 <whitequark> which is the same size as ISC_CONFIG

16:35 <whitequark> ah

16:35 <mwk> which ISC_NOOP accesses

16:36 <mwk> and is *also* chained in front of ISC_READ (which explains a bit) and in front of ISC_PROGRAM (which seems to be a lie)

16:37 <whitequark> right

16:37 <mwk> ISC_ENABLE and ISC_DISABLE are also 5 bits long FWIW

16:37 <mwk> this matches

16:37 * mwk continues reading

16:44 rohitksingh has joined ##openfpga

16:47 <mwk> hrm, wonderful specification

16:47 <mwk> doesn't actually specify what the status register *is*

16:47 <mwk> it just says there may or may not be one

16:49 <mwk> it doesn't actually seem to specify much at all

16:52 <whitequark> lol

16:54 <whitequark> mwk: so i'm doing something cursed

16:54 <whitequark> clock in JPROGRAM, then watch the IR while shifting in BYPASS

16:54 <whitequark> so that it loads from the memory

16:54 <mwk> you know what, fuck that

16:55 <mwk> fuck the standard and fuck the bsdl file

16:55 <mwk> I'll just blackbox this, because of them seem to be made of 70% lies

17:01 <mwk> so there is some sort of ISC status register indeed

17:02 <mwk> 0b00000 when in unprogrammed state, 0b01110 when in unprogrammed + ISC_ENABLED state

17:02 <mwk> and when you read ISC_READ, the low 5 bits are the ISC status register

17:03 <mwk> also, apparently the high 64 bits of ISC_READ are writable

17:03 <whitequark> but why

17:03 <mwk> *shrug* shift register

17:04 <mwk> in some circumstances it seems to overwrite half of my value (32 bits) with 0s, presumably because I triggered a read somehow

17:04 zino has quit [Ping timeout: 272 seconds]

17:04 <whitequark> oh wait

17:04 <whitequark> is the high part of ISC_READ the command and the low part the result?

17:04 <mwk> I... don't think so

17:04 <mwk> but could be

17:05 <mwk> also that would be "high high" part

17:05 <mwk> it's a 32-32-5-bit register

17:05 <whitequark> low, high, and high high?

17:05 <mwk> I guess we should settle for high, low, and status

17:06 zino has joined ##openfpga

17:06 <mwk> alright, let's actually try to read something

17:06 <whitequark> "high high" is what the designers of this interface were

17:16 <mwk> another random observation: JPROGRAM does not deassert ISC_ENABLED

17:17 <whitequark> yes, xc3sprog does ISC_ENABLE; JPROGRAM

17:17 <whitequark> for no very good reason

17:17 <whitequark> other than reading ISC_DNA perhaps

17:18 <mwk> IEEE 1532 says you're not supposed to shift in any non-ISC command without doing ISC_DISABLE by the way

17:18 <mwk> guess that's another thing noone cares about

17:19 <whitequark> of course

17:20 <mwk> arrrgh

17:20 <mwk> and if you JPROGRAM and then try to ISC_DISABLE

17:21 ZipCPU has quit [Quit: ZNC 1.6.4 - http://znc.in]

17:21 <mwk> then you've just executed a successful startup sequence with no bitstream and of course DONE goes high on this device

17:26 <whitequark> mwk: on xc6s, ISC_ENABLE plus clocking RTI gets DONE=0 ISC_DONE=1

17:26 <whitequark> i'm confused

17:26 <whitequark> wtf is ISC_DONE?

17:27 <mwk> EOS I think

17:27 <mwk> that's consistent with my results too

17:27 <whitequark> hm, yes, if I JPROGRAM, it goes low

17:27 <mwk> I don't know if it can be pulled low short of JPROGRAM

17:32 rohitksingh has quit [Ping timeout: 245 seconds]

17:33 ZipCPU has joined ##openfpga

17:35 ZipCPU|Alt has joined ##openfpga

17:36 <mwk> my attempts at programming the FPGA via ISC_PROGRAM are failing miserably

17:39 zino has quit [Excess Flood]

17:39 rohitksingh has joined ##openfpga

17:39 zino has joined ##openfpga

17:40 ZipCPU|Alt has quit [Quit: Cap'n! The dilithium crystals are ...]

17:47 <mwk> aha!

17:48 <mwk> got it

17:48 <mwk> you have to load 32 bits per shift

17:48 <mwk> and do an RTI cycle after every 32 bits

17:48 <mwk> no RTI == no worky

17:50 <mwk> any non-0 amount of RTI cycles is acceptable, matter of fact

17:54 <mwk> using ISC also requires you to maintain word alignment

17:54 <mwk> if you shift the bitstream by one byte, it no longer configures

17:55 <mwk> presumably because it doesn't use the "serial config mode" deserializer

17:55 <whitequark> ooooh

17:56 <whitequark> so what is the bit order for ISC_PROGRAM?

17:56 <mwk> same as for CFG_IN...

17:56 <whitequark> so, bit reversed bytes?

17:56 <whitequark> 4 at a time?

17:56 <mwk> you just have to chop it into 32-bti units

17:56 <mwk> yes

17:56 <whitequark> hrm

17:56 <whitequark> why are they bit reversed, i wonder

17:56 <whitequark> stupid ass behavior

17:56 <whitequark> oh right

17:56 <whitequark> SPI flashes...

17:56 <mwk> it's inconsistent with sanity

17:56 <mwk> but consistent with CFG_IN

17:58 <whitequark> Xilinx® All Inconsistent With Sanity

17:58 <whitequark> oh oh i know what "IS" in "ISC" means now.

17:58 <mwk> heh

17:59 <mwk> alright, so I can program shit with ISC

17:59 <mwk> that's... not particularly useful given that CFG_IN is less hassle

18:00 <mwk> for a full bitstream, at least

18:00 <mwk> so let's try reading now

18:05 <mwk> it seems you also have to clock RTI with ISC_READ to actually read shit

18:18 rvense has quit [Quit: leaving]

18:21 zino has quit [Ping timeout: 248 seconds]

18:31 rohitksingh has quit [Ping timeout: 272 seconds]

18:32 <whitequark> right

18:33 <mwk> alright, finally

18:33 <mwk> got that damn readback

18:34 zino has joined ##openfpga

18:34 <mwk> ISC_PROGRAM <- sync word + read IDCODE + two NOPs [with RTI after each word]

18:35 <mwk> then load ISC_READ, RTI, and read from DR, the (reversed) readback value is in the low word

18:38 <whitequark> nice

18:38 <whitequark> what about the high wor

18:38 <mwk> *shrug*

18:38 <mwk> I'll try reading several words and see what happens

18:41 <mwk> alright

18:42 <mwk> so you always read 2 words at a time

18:42 <mwk> if I submit a "read 5 words from IDCODE" packet

18:43 <mwk> and start ISC_READing, I get: 2× (IDCODE, IDCODE), 1× (0, IDCODE), inf× (0, 0)

18:44 <mwk> also: probably unsurprising, but ISC_PROGRAM/READ doesn't work without lighting ISC_ENABLED

18:53 <mwk> again I don't understand something

18:53 <daveshah> Hehe, just realised some of the commands in ECP5 bitstreams are actually 1532 commands (e.g. the command to set usercode is called ISC_PROGRAM_USERCODE and also defined thus in the BSDL file)

18:54 <mwk> so as long as IR has any ISC_* command in it, the outputs are in HIGHZ

18:54 <mwk> but, ISC_DISABLE is also effectively JSTART

18:55 <mwk> so as you go through the startup sequence, you enable GWE, and the clocked logic inside FPGA goes active

18:55 <mwk> but your outputs are still disconnected for as long as it takes the programmer to notice that ISC_ENABLED went low and change IR to BYPASS for whatever

18:56 <mwk> this... won't work all that well for lots of circuits

19:02 <mwk> *sigh* confirmed, the design is already running and blinking LEDs while opcode is still ISC_DISABLE

19:03 <mwk> and the LEDs are disconnected

19:04 <mwk> can I just conclude that ISC is a steaming pile of shit?

19:05 <ZirconiumX> Can't you conclude that for most things?

19:15 <mwk> daveshah: so how does this work? do the packet processor and JTAG share some of the command set on ECP5?

19:15 <daveshah> Yup

19:15 <mwk> because they're almost completely separate on xilinx

19:16 <mwk> JSTART/JSHUTDOWN being the notable exceptions

19:17 <daveshah> There's also a command LSC_BITSTREAM_BURST that fires TDI directly into the bitstream packet processor (I think similar to CFG_IN in Xilinx?)

19:17 <mwk> that sounds like CFG_IN, yes

19:18 <daveshah> Incidentally, someone recently hit an interesting problem where the chip failed to program if you had a SPI mode command (e.g. to enable QSPI in a flash bitstream) in the bitstream given to that instruction

19:24 <mwk> does... does it enable quad-JTAG in this case?

19:24 <mwk> that would be hilarious

19:24 <daveshah> Alas not

19:24 <daveshah> Just sets various meaningless error bits in the status register

19:25 <daveshah> Although perhaps it is actually trying to talk to the flash

19:25 <daveshah> I should check that

19:26 <whitequark> quad-JTAG

19:27 <whitequark> i'm not sure if i love or hate it

19:27 <whitequark> it's certainly disgusting

19:31 <sorear> does it use both edges of TCK?

19:35 <whitequark> sorear: aaaaaaa

19:40 <daveshah> https://patents.google.com/patent/US7725791

19:40 <azonenberg> {TDI, TDO, TMS, TRST} on rising and falling tck edges

19:40 <azonenberg> for 8x the jtag bandwidth :D

19:43 <whitequark> daveshah: no. no no no no

19:48 <mwk> daveshah: aaaaaaaaaaa

19:49 <tnt> mwk: well, did you see spy-bi-wire ?

19:50 <tnt> it's serialized tms/tdi/tdo in 3 'clk' timeslots.

19:50 <whitequark> azonenberg: nonono, that's too straightforward

19:50 <whitequark> you have to sample TMS only on like every 2nd edge

19:51 <whitequark> and TRST would be totally asynchronous and with some annoying setup/hold constraint to TCK

19:51 <whitequark> and there should be like 5 "modes" and which one you can use you only learn after interrogating the device in the slowest one

19:51 <mwk> I think every 4th clock should be TMS

19:51 <mwk> and every 16th TRST

19:58 <davidc__> heh, I've seen a debug protocol that manages to combine the worst of both worlds between I2C and a synchronous bus

19:58 <davidc__> (device originates "debug clock". Debug IO pin is open drain

19:59 <davidc__> Periodic framing bit is sent (10 bits IIRC?) on the debug IO pin from the device

20:00 <davidc__> IIRC, one of the bits also signals an "interrupt" from the device to host; and the remaining 8 bits are used bidirectionally to convey higher protocol layer bytes in each direction

20:00 <whitequark> daveshah: that... that reminds me acutely of PS/2

20:00 <whitequark> it's almost identical in many aspects

20:00 <whitequark> davidc__: * sorry

20:04 <davidc__> For bonus points, the debug clock changes wildly during device boot, since its derived directly from the core clock

20:04 <davidc__> so as you change the core PLL / etc, the debug clock changes with it

20:04 <davidc__> (including bonus glitches during clock change!)

20:05 <sorear> Does that require a wire short enough that propagation delays are ignorable?

20:05 <davidc__> Probably. The only debugger for this monstrosity is the vendors debug tools; and it has a captive cable

20:05 <davidc__> (might explain why it has a captive cable!)

20:07 <whitequark> ha

20:09 <davidc__> vendor's tool was $2k or so.... supported a total of 3 parts (very very niche but high volume part)

20:09 <davidc__> Internally, it was just a cypress USB bridge + FPGA (similar to glasgow) but much older parts

20:23 <implr> that's a quite popular combo, xilinx's platform cable is like that too

20:24 <whitequark> fx2 is a super common thing to put together to an fpga

20:25 <daveshah> Saw one video capture card which was PCIe -> USB3.0 bridge -> FX3 -> FPGA

20:26 <whitequark> that's pretty cursed actually.

20:26 <whitequark> so one day i wanna make you know what? boneless fx

20:26 <whitequark> like fx2 but with boneless instead of 8051 and all on an fpga

20:27 <emeb> sure. eliminate parts.

20:28 <whitequark> no, it will only work at full speed

20:28 <whitequark> because HS USB is cursed

20:28 <whitequark> i guess with a PHY it could work at HS.

20:28 <emeb> eliminating 8051s is always a net positive

20:28 <whitequark> true

20:28 <whitequark> i might also implement a formally verified 8051

20:28 <emeb> but yeah - you'd have to design a fully FPGA-able HS PHY.

20:28 <daveshah> USB3300s are super cheap

20:28 <whitequark> i *think* it could be completely microcoded in one 256x16 BRAM

20:29 <davidc__> whitequark: that requires something to verify against

20:29 <daveshah> If you don't care about adding a few BOM lines

20:29 <whitequark> daveshah: and the spec, of course

20:29 <whitequark> arrr

20:29 <whitequark> davidc__: ^

20:30 <davidc__> I think the biggest downside of eliminating the USB controller/processor is how one handles reconfiguration / bitstream testing

20:30 <davidc__> I wish proper partial reconfiguration was a thing

20:31 <whitequark> yes

20:31 <whitequark> glasgow will always use fx*

20:31 <whitequark> because they're extremely foolproof

20:31 <davidc__> something something greater fool

20:31 <whitequark> i didn't say completely

20:32 <emeb> 500yrs from now 8051s will still exist. buried in essential infrastructure.

20:32 <whitequark> you *can* soft-brick glasgow revC

20:32 <whitequark> (you need to short 2 pins to fix that)

20:32 kuldeep has quit [Remote host closed the connection]

20:32 <whitequark> you *can* hard-brick glasgow revC by electrically destroying it

20:32 <whitequark> or mechanically

20:32 <daveshah> Even with partial reconfig you'll still want to update your wrapper/"bootloader" module at some point

20:32 <whitequark> but not short of that

20:32 <daveshah> Which is where something like an FX2 is very nice (and where countless tinyfpgas have been lost)

20:33 <TD-Linux> daveshah, you could aways just use two fpgas too. not worse than fpga + fx2

20:33 <TD-Linux> tinyfpgas don't do partial reconfig though. what kills them?

20:33 <daveshah> I think USB3300+ECP5-12k is similar component price wise to fx2 (but higher assembly cost from more parts)

20:33 <whitequark> daveshah: in fairness tinyfpgas have a really stupid bootloader and i don't know why

20:33 <daveshah> The bootloader update going wrong

20:33 <whitequark> it's not hard to validate commands

20:33 <daveshah> This isn't even to do with command validation

20:33 <whitequark> ah

20:34 <daveshah> This is to do with the programmer dieing in the middle of updating the bootloader

20:34 <whitequark> oh it doesn't do atomic updates

20:34 <daveshah> e.g. due to modemmanager, random command validation, etc

20:34 <davidc__> daveshah: I guess there's restrictions on doing an A/B scheme

20:34 <daveshah> *random command errors

20:34 <TD-Linux> yeah you can't really do A/B because the fpga reads from the same start address always

20:34 <daveshah> It could easily copy the image into spare flash, write an updater into the user app area, and warmboot into that

20:35 <whitequark> ^

20:35 <TD-Linux> ah true

20:35 <daveshah> Then the only risk is a power failure or FPGA uoset, as opposed to any kind of issue with the Linux USB-serial infrastructure or fragile Python atop it

20:35 <davidc__> TD-Linux: depends on the FPGA. Some will search for a magic, and a magic at offsets

20:35 <daveshah> ECP5 has a golden image feature like that

20:35 <TD-Linux> you could also just never update the first stage bootloader

20:36 <davidc__> If you write the magic after verifying the rest of the image, its hard to brick

20:36 <whitequark> yeah, write it in the flash and protect it

20:36 <whitequark> just make that region OTP ROM

20:36 <whitequark> atmel dataflash can do that

20:39 <azonenberg> davidc__: this is what i like about xilinx parts

20:39 <azonenberg> they have pretty good support for fallback boot

20:39 <azonenberg> i havent had to use it yet (none of my boards have been field updateable)

20:39 <whitequark> ecp5 has extensive support for that

20:39 <azonenberg> but good to have

20:45 <davidc__> I'm a big fan of "can flash entire board through single cable"

20:46 emeb_mac has joined ##openfpga

20:48 <azonenberg> davidc__: well most of my boards these days are just an fpga with no other programmable stuff :p

20:49 <azonenberg> i cant recall the last time i used a MCU in a design

20:49 <azonenberg> integralstick has one but i've never used it yet

20:49 <TD-Linux> still haven't figured out ideal ice40 end user flashing solution. a more robust version of tinyfpga programmer would be great but still too big for hx1k

20:49 <TD-Linux> ffp is my favorite so far

20:50 <TD-Linux> but still requires a separate flashing step

20:50 <davidc__> azonenberg: I typically use FTDI parts as data pump + flasher. They have their challenges, but once you can make them work, they work.

20:51 <azonenberg> davidc__: most of the time i use raw jtag for flashing and ethernet for communication

20:51 <azonenberg> i want to write a tftp bootloader at some point

20:51 <azonenberg> i'm trying hard to move away from USB, including but not limited to ftdi

20:52 <davidc__> azonenberg: I've played with some designs for an ethernet-capable flash+JTAG debugger

20:52 <davidc__> azonenberg: looked at some PHYs that have NC-SI or other sideband

20:53 <davidc__> azonenberg: I'd love to come up with some BOM cost under $10 design, and make a pile of castellated modules

20:54 <davidc__> then just solder them down in designs + get JTAG/SPI muxed into an existing 1G ethernet port

20:57 <azonenberg> ah yeah that is very different, i was talking about actually having application layer firmware updating using the existing udp/ip soft logic on the fpga

20:58 <azonenberg> as far as jtag over ethernet for prototypes, my plan is starshipraider

20:58 <azonenberg> i actually just dusted off (literally) the prototype

20:58 <azonenberg> gearing up to do more work on it as i slowly unpack the lab

20:58 <azonenberg> i passed electrical inspection this morning, city final tomorrow

20:58 <azonenberg> So we're moving in momentarily

20:59 <azonenberg> Still gonna be a bit of a mess for a few months because i just burned my month's spare cash on a 6 kVA Eaton UPS that hasn't come in yet

20:59 <azonenberg> once my wallet recovers i'm buying some cabinets and shelving

21:03 <whitequark> TD-Linux: i think i can fix USB+boneless into hx1k

21:03 <whitequark> fit*

21:03 <whitequark> i might implement it later

21:03 <whitequark> davidc__: "I'd love to come up with some BOM cost under $10 design, and make a pile of castellated modules" this was the original design for glasgow :D

21:04 <whitequark> design intent*

21:04 <TD-Linux> whitequark, that would be pretty cool. hx1k allows the very easy tqfp package that works on 2 layer boards

21:04 <whitequark> alright

21:04 <whitequark> btw how is that LPC-ISA board going?

21:05 <TD-Linux> I haven't done much on it yet, I'm on "vacation" on the farm right now. (which actually has 100/100 fiber, I have no excuse)

21:05 <whitequark> ah heh

21:06 <TD-Linux> the fpga on there needs to be flashable, which is why I was thinking about this :)

21:06 <whitequark> TD-Linux: oh that's trivial

21:06 <TD-Linux> I'm probably going to use either the tqfp100 or tqfp144 package (hx4k) on it. without pci it's a lot easier

21:06 <whitequark> just wire the LPC pins to the configuration bank

21:07 <whitequark> and a sideband to Glasgow or a jumper, depending on pin restrictions

21:07 <whitequark> i think only a jumper will work with one IDC

21:07 <TD-Linux> sure, that would work.

21:08 emeb_mac has quit [Ping timeout: 244 seconds]

21:20 <davidc__> whitequark: heh. At some point I want to make a derivative of glasgow that is castellated (basically, drop the level shifters)

21:21 <davidc__> whitequark: For solder-in use on prototypes

21:21 <whitequark> davidc__: with the fx2 and everything?

21:21 <davidc__> whitequark: yeah, FX2 + FPGA... though I might look into whether different packages are available

21:21 <whitequark> also, i assume you will drop the bitstream EEPROM

21:21 <whitequark> but keep the FX2 one?

21:22 <whitequark> what about programmable pulls?

21:22 <whitequark> i would be very worried about supporting a glasgow derivative that doesn't have programmable pulls

21:22 <whitequark> but e.g. one that has fewer ports or no programmable Vregs seems basically fine

21:23 <davidc__> Oh, I'd have absolutely no expectations for upstream support

21:23 <whitequark> i think if you keep the ADC and pulls i could support it upstream.

21:23 <davidc__> If upstream breaks, thats my problem, since the board will probably only exist in my shop :P

21:23 <whitequark> if you don't you'd be basically making a revB and many applets (like i2c) barf on revB

21:25 <davidc__> current plan is to drop the ADC; and just have a VIO provided by the host board since the IO voltage will already be around

21:25 <whitequark> right, you'd obviously drop the DAC and LDO

21:26 <davidc__> er, duh, DAC

21:26 <davidc__> Sorry :)

21:26 <whitequark> but the ADC is kind of important to the flow

21:26 <davidc__> whitequark: I'll get back to you once I have a more defined plan :)

21:26 <gruetzkopf> i have previously slapped my revb into my products backplane with a terrible perfboard adapter, can confirm usefulness

21:27 <whitequark> davidc__: with the ADC and pulls i'd be perfectly happy to have the board and code upstream

21:27 <whitequark> this is certainly something people have asked for

21:27 <gruetzkopf> (though that was via the headers - and i'm dealing with big stuff that fills a 3U eurocard carrier or 200)

21:29 <mwk> whitequark: so I would like to write up some of that stuff about JTAG and also about the packet processor

21:29 <mwk> is there some obvious place I can put it?

21:29 <whitequark> mwk: yes. let's rename glasgow.arch.xilinx.xc6s to glasgow.arch.xilinx.fpga

21:29 <whitequark> and put it there

21:30 <mwk> what? in the big comment on top?

21:30 <whitequark> yep

21:30 <whitequark> i plan to convert these to docstrings at some point

21:30 <whitequark> but a *lot* of applet and arch files have huge comments explaining things

21:30 <whitequark> and i refer people to these a lot

21:30 <whitequark> check out the one for floppies :D

21:31 <mwk> oh yes, that one was quite epic

21:31 <whitequark> you can also drop every document you directly reference to docs/archive/

21:31 <whitequark> i try to do that

21:31 <whitequark> although it sounds like most of your work is blackbox RE

21:32 <mwk> pretty much yes

21:32 <whitequark> i should certainly make sure you get a glasgow, at some point :p

21:32 <azonenberg> gruetzkopf: starshipraider is descended from a management card that i was going to put in my MARBLEWALRUS fpga cluster

21:32 <azonenberg> Ethernet to nine lanes of jtag, nine lanes of uart, nine lanes of i2c, and a bunch of gpios and other stuff

21:33 <whitequark> no kill like overkill

21:33 <azonenberg> for one side of a 3U eurocard chassis (two ten-card backplanes plus a PSU in 3U)

21:35 <gruetzkopf> yeah i'm running a 25-slot (one of them controller) plane plus PSU

21:35 <mwk> hmmm

21:35 <mwk> I basically have all the xilinx user guides / config guides / whatever guides mirrored already

21:36 <mwk> 0.5GB of crap

21:36 <whitequark> i have a lot of them in my library

21:36 <whitequark> check out Electronics/Programmable Logic/Xilinx

21:36 <whitequark> with nicer names so you can use desktop file search like spotlight

21:37 <mwk> I um

21:37 <mwk> I get an internal server error when I try to log in

21:37 <mwk> Remote Address: ::ffff:84.10.63.242

21:37 <whitequark> uhh. try again

21:37 <mwk> Request ID: GBtfBlVQ7g1eqYFmaqgE

21:37 <whitequark> it does that sometimes intermittently

21:37 <whitequark> hateful heap of php crap

21:38 <mwk> oh yes, worked now

21:38 <whitequark> the actual docs can be synced (partially too) with a desktop client or via any webdav client

21:39 <gruetzkopf> my next thing to poke at is this PABX which implements the TDM switch in a XC2V2000

21:40 <mwk> nice

21:40 <mwk> so, whould I just dump my collection there?

21:41 <whitequark> if you can make it fit the existing structure you can dump it right into the main hierarchy yes

21:41 <whitequark> if you don't want to bother dump it to Incoming/ which is world writable

21:41 <mwk> easy

21:41 <whitequark> for the former i'd need to give you a write bit

21:41 <whitequark> ok give me a sec

21:42 <whitequark> feel free to dump any other docs into it as long as they're sorted and renamed

21:42 <whitequark> the scope is "all documentation"

21:42 <whitequark> mwk: you have the write bit

21:52 <azonenberg> mwk: let me see how big my datasheet archive is...

21:52 balrog has quit [Quit: Bye]

21:53 <azonenberg> azonenberg@havequick:/nfs4/share/datasheets$ du -h --summarize .

21:53 <azonenberg> 3.6G .

21:53 <azonenberg> but that isnt all xilinx

21:53 <azonenberg> only 612 MB of xilinx stuff

21:55 <whitequark> ~/Library$ du -hs .

21:55 <whitequark> 37G .

21:56 <azonenberg> whitequark: is that just datasheets though?

21:56 <azonenberg> or journal articles, books, etc

21:57 balrog has joined ##openfpga

21:58 <whitequark> azonenberg: a lot of it is the entire set of DIN standards

21:59 <whitequark> hm, a FIB manual

21:59 <whitequark> a directory called "missile textbooks"

22:00 <azonenberg> lol

22:00 <mwk> hilarious software

22:00 <whitequark> a huuuuge collection of Intel manuals with information Intel scrubbed from it

22:00 <whitequark> the VISA stuff

22:00 <mwk> if I'm renaming a file and corrently entering the file name, and the browser window loses focus because I move mouse to another display, it performs the rename immediately

22:01 <whitequark> mwk: that's common to desktop software. but yeah ill-advised in browser

22:01 <whitequark> it's also very very slow

22:01 <mwk> whitequark: so the format is "DS420 <title straight from the document> v6.9.pdf"?

22:01 <whitequark> the server is in uhhhh singapore for uhhh reasons

22:01 <whitequark> mwk: exactly

22:02 <mwk> alright, let's do that

22:02 <whitequark> just enough to make it easily searchable with spotlight if content indexing is turned off

22:02 <whitequark> (or the kde spotlight-like thing)

22:03 <whitequark> because while it *does* index 37 GB of mostly PDFs that takes a while

22:03 <azonenberg> i dont have any of my stuff indexed

22:03 <azonenberg> i just have /nfs4/share/datasheets/Vendor[/family if lots of stuff]/File.pdf

22:03 <whitequark> azonenberg: i can do super+r ug380 enter and it gives me the document

22:04 <whitequark> or super+r ieee 1352

22:04 <whitequark> all the world's standards at my fingertips :p

22:04 <whitequark> well not all. not *yet*

22:04 <whitequark> i should scrape all of ieee sometime

22:06 <gruetzkopf> whitequark: my (very incomplete) DIN set is 100G

22:06 <whitequark> gruetzkopf: hm

22:06 <whitequark> did the DIN upload not finish?

22:07 <whitequark> 11981 files

22:07 <whitequark> 9.6 GB

22:08 <gruetzkopf> my DIN - no - other - orgs involved set is 23.5kFiles alone

22:09 <whitequark> ah crap

22:10 <whitequark> wanna complete my set?

22:10 <gruetzkopf> mine's incomplete too, but apparently bigger now. i actually have non-terrible internet now too!

22:11 <mwk> whitequark: do you want Xilinx PROMs, SystemACE, etc. in the same directory, or is there another place for them?

22:11 <whitequark> mwk: it's related to programmable logic so it's fine

22:12 <mwk> (of course you're going to get the SystemACE™ datasheet from me)

22:12 <whitequark> it's also ok to put the same file in multiple places

22:12 <whitequark> i don't *think* it deduplicates things but that's temporary

22:12 <Ultrasauce> find something less janky than nextcloud?

22:12 <whitequark> Ultrasauce: it's on the todo list

22:12 <whitequark> find or write

22:12 <Ultrasauce> ah yes

22:12 <Ultrasauce> another project

22:12 <whitequark> because i don't think anynone implemented a system with quite the featureset i want

22:13 <whitequark> trustless distributed encrypted storage

22:13 <whitequark> tahoe-lafs but like, with write speeds over 100 kb/s

22:13 <whitequark> because i want to be able to upload ~100 TB into it an have itwork

22:17 <gruetzkopf> whitequark: i would but the seafile2seafile link fell over

22:17 <gruetzkopf> *nextcloud2nextcloud

22:20 dh73 has quit [Remote host closed the connection]

22:20 <whitequark> gruetzkopf: aw, crap

22:20 <whitequark> is it debuggable

22:21 <gruetzkopf> Have spent zero time on trying

22:22 <gruetzkopf> Should just grab a normal acct

22:22 <whitequark> understandable

22:27 Bike has joined ##openfpga

22:37 <mwk> whitequark: it appears your "Series 7" directory contains, in fact, mostly UltraTrash documentation

22:37 <mwk> I'll fix it...

22:41 <TD-Linux> UltraLimescale

22:49 <mwk> hrm, it appears that old libraries guides don't have a UGxxx number

22:50 <mwk> how annoying

23:27 <whitequark> mwk: hehe, another person who calls it UltraTrash

23:28 <kc8apf> Where do I get access to this library?

23:29 <mwk> if I have a different document version than you, I'm supposed to upload it and we keep both, right?

23:30 <kc8apf> Unrelated: new Spectre variant came out of embargo today

23:35 <mwk> windows-only hardware bug?

23:35 <mwk> what the fuck?

23:47 emeb has quit [Quit: Leaving.]

23:50 emeb_mac has joined ##openfpga