##openfpga on 2019-11-25 — irc logs at freenode.irclog.whitequark.org

00:00 <azonenberg> my understanding is that the problem is when you have clock-to-out less than the hold time

00:00 <azonenberg> presumably at the extremes of PTV

00:00 <daveshah> The problem is to do with hold time plus clock skew

00:00 <mwk> azonenberg: also when you have shit time skew

00:00 <daveshah> Without clock skew, Xilinx would be fine

00:00 <mwk> er, clock skew

00:01 <azonenberg> oh if you have clock skew that makes it worse

00:01 <daveshah> But afaik at certain clock region boundaries there is skew

00:01 <daveshah> Yeah

00:01 <mwk> consider the clock region boundary

00:01 <daveshah> Both Xilinx and Intel do take the approach that their tools are good enough they can take more liberties in hardware

00:02 <mwk> Y0 vs Y1

00:02 <daveshah> Lattice don't trust their tooling in the same way

00:02 <whitequark> so in principle, if (considering the verilog i posted above) if you have BUFG->clka->BUFGCE->clkb, and the delay of BUFGCE plus clock skew is less than hold time,

00:02 <mwk> clock regions are 50 CLBs high

00:02 <whitequark> then there would be no race?

00:02 <whitequark> (can you even connect a BUFGCE to the output of BUFG?)

00:03 <mwk> two immediately neighbouring FFs straddling the boundary have the same path length and delay time between their vertical spine tap and the FF clock input

00:03 <mwk> but the vertical spine taps are 50 CLB heights apart

00:03 <mwk> so if they are not the middle two rows, you have 50 CLBs worth of clock skew between them

00:04 <mwk> whitequark: connecting BUFG output to BUFGCE input is a bad idea

00:04 <mwk> what you want is to have src->BUFG->clka and src->BUFGCE->clkb

00:04 <whitequark> yes, I know I wouldn't want to do this in a real design

00:04 <whitequark> I'm not making an FPGA bitstream, I'm writing a simulator

00:04 <whitequark> so I specifically look at bad ideas

00:05 <mwk> some FPGAs have dedicated paths between BUFG outputs and BUFG inputs

00:05 <mwk> some virtex 4/5/6 definitely does, spartan 3 definitely does not

00:05 <whitequark> aha

00:05 <mwk> let's look it up

00:06 <whitequark> oh another question re hold time. I've heard that DFFs can be implemented as two latches with different active polarity in series

00:06 <mwk> yeah, series 7 BUFG outputs can be connected to BUFG inputs via dedicated paths

00:06 <whitequark> doesn't that basically shift D by 180°?

00:06 <daveshah> whitequark: I think this is even the most common way these days

00:06 <mwk> yes, and that's The Standard Way to do a flop

00:06 <whitequark> so your hold time is at least a half period

00:06 <mwk> no

00:07 <whitequark> hm

00:07 <mwk> hold on, I had a nice demo I showed to my students

00:07 <whitequark> where am I mistaken?

00:07 <mwk> whitequark: only one latch is ever enabled

00:08 <whitequark> sure

00:10 <whitequark> if it's posedge triggered, then the first half cycle, first latch is transparent, and second latch is holding the previous value

00:10 <mwk> yes

00:10 <whitequark> the second half cycle, first latch is holding the value, and second latch is transparent

00:12 <whitequark> assuming the delay of the latches themselves is negligible, this means that Q is exactly D shifted 180°

00:12 <whitequark> no?

00:14 <whitequark> if the delay isn't negligible then it's shifted 180+n°, and unstable for 360-n°

00:14 <whitequark> something like that anyway

00:15 <whitequark> er, unstable for n°

00:16 <mwk> YES, found it

00:16 <mwk> http://www.play-hookey.com/digital/sequential/d_nand_flip-flop.html

00:17 <mwk> that was the thing that actually helped me understand how that thing works

00:17 <mwk> beware: it's a *negedge* flop

00:18 <whitequark> hm

00:18 <mwk> (click on the inputs on the left to have fun)

00:18 <whitequark> oh it has javascript

00:19 <mwk> so when clock is 0, the left latch is holding state, the right latch is transparent

00:19 <mwk> and since left latch cannot change, right latch is effectively frozen as well

00:20 <whitequark> yep

00:20 <mwk> when 0-to-1 transition happens, left latch suddenly opens (to transparent), and right latch suddenly closes (to hold)

00:21 <whitequark> yep

00:21 <mwk> since this happens at the same time, and they both have output delay, the right latch will always end up holding whatever left latch held before

00:21 <mwk> when clock is 1, the left latch is transparent and the middle data line keeps following input; the right latch is holding

00:22 <mwk> and on the active 1-to-0 transition, the right latch opens and gets whatever value is currently on the middle data line (which was connected directly to input until now, only a single NAND of delay)

00:22 <mwk> at the exact same time, left latch closes and freezes its value

00:23 <whitequark> yep. i understand that

00:23 <mwk> and this is how the whole thing manages to have only 1-2 NAND delay worth of setup/hold time

00:24 <whitequark> wait

00:24 <whitequark> i know what went wrong

00:24 <whitequark> i used "hold time" incorrectly

00:25 <mwk> and about delaying stuff by 180°

00:25 <mwk> suppose you want to chain two DFFs with opposite clock polarity

00:25 <mwk> (which is a thing you want to do in DDR I/Os)

00:26 <mwk> if you draw the two FFs next to each other, you'll notice that the middle two latches are exactly identical and redundant

00:26 <mwk> so the circuit can actually be optimized to three latches :)

00:28 <mwk> and this is exactly what they actually do in DDR I/O logic

00:29 <mwk> you have three latches in a row; if you select SAME_EDGE ddr, you use all of them; if you select plain FF, you use two of them and bypass the third; and if you select a latch, you bypass all but one

00:29 <mwk> quite elegant

00:32 <whitequark> yes, I was wondering about the redundant middle latch

00:32 <whitequark> neat

00:34 <whitequark> also I'm not sure how it's called but when I said "hold time" earlier I meant "period minus propagation delay time" and it's not half period but rather almost the entire period

00:34 <whitequark> it was a fairly dumb question

00:34 <whitequark> because that's just how a DFF works

00:34 <whitequark> but I understand it better now, anyway

00:35 <whitequark> thank you :)

00:36 <mwk> yeah, I'm really glad I found that demo thing :)

00:36 <mwk> worked wonders for my class as well

00:36 <whitequark> it turns out that I understood the basic structure just fine without the demo, actually

00:37 <whitequark> but I never really thought about the timing implications of it properly before

00:37 <mwk> ah, fair enough

00:37 <whitequark> although it's definitely useful to see exactly how it's implemented on gate level, too

00:54 <whitequark> hm

00:54 <whitequark> the SB_IO circuit in the datasheet shows two FFs and a mux selected by the clock

00:54 <whitequark> for the DDR output

00:54 <whitequark> I guess that's a lie then

00:55 <mwk> does ice40 have something like SAME_EDGE mode?

00:56 <mwk> ie. for the output register, the data for both phases is actually sampled at the same edge of the clock

00:56 <mwk> to make interfacing from the fabric easier

00:56 <whitequark> no, you have to do that yourself

00:56 <whitequark> (nmigen inserts this FF :)

00:56 <mwk> ah, then they don't use that trick

00:56 <whitequark> ohh

01:06 <mwk> also tbh it's not really that awesome

01:06 <mwk> given the area proportion between a single latch and a big honking I/O pad

01:07 <mwk> it'd be really hard to notice it at all

01:08 <whitequark> well, it's cute

01:08 <mwk> agreed

01:46 <cr1901_modern> >(nmigen inserts this FF :)

01:46 <cr1901_modern> To simulate SAME_EDGE mode?

01:47 <mwk> of course, what else

01:47 <mwk> you want these two bits in the same clock domain to do anything with them

01:47 <whitequark> yep

01:48 <whitequark> i mean, nextpnr actually deals with it just fine, but your Fmax drops in hal

01:48 <whitequark> *half

01:48 <whitequark> so you probably don't want that :p

01:48 <cr1901_modern> >of course, what else

01:48 <cr1901_modern> Sorry, long day lol

03:14 pie__ has quit [Ping timeout: 240 seconds]

03:25 Bike has quit [Quit: Lost terminal]

03:57 nrossi has joined ##openfpga

04:34 rohitksingh has quit [Ping timeout: 245 seconds]

04:34 ZombieChicken has quit [Ping timeout: 240 seconds]

04:44 <TD-Linux> rip mips. not a single tear was shed https://www.hackster.io/news/wave-computing-closes-its-mips-open-initiative-with-immediate-effect-zero-warning-e88b0df9acd0

06:00 ZombieChicken has joined ##openfpga

06:04 rohitksingh has joined ##openfpga

06:06 Jybz has joined ##openfpga

06:24 ZombieChicken has quit [Ping timeout: 240 seconds]

06:29 Jybz has quit [Quit: Konversation terminated!]

07:24 freemint has joined ##openfpga

07:38 ZombieChicken has joined ##openfpga

07:38 awordnot has quit [Ping timeout: 276 seconds]

07:40 freemint has quit [Remote host closed the connection]

07:40 freemint has joined ##openfpga

07:42 freemint has quit [Remote host closed the connection]

08:06 awordnot has joined ##openfpga

08:34 keesj has joined ##openfpga

09:26 _whitelogger has joined ##openfpga

10:26 <OmniMancer> daveshah: where in the database do you note down the relation between IO sites and package pins?

10:26 <daveshah> OmniMancer: https://github.com/SymbiFlow/prjtrellis-db/blob/master/ECP5/LFE5U-25F/iodb.json

10:27 <OmniMancer> cool, are those collected via fuzzing?

10:28 <daveshah> Nope, by parsing the Lattice CSV files

10:28 <daveshah> They use names like PT42A that directly correspond to site locations

10:28 <daveshah> (i.e. top row, col 42, PIO A)

10:28 pie_ has joined ##openfpga

10:31 <OmniMancer> ah

11:18 mifune has quit [Ping timeout: 240 seconds]

11:18 mifune has joined ##openfpga

11:40 m4ssi has joined ##openfpga

11:44 massi_ has joined ##openfpga

11:44 BusterTheDummy has joined ##openfpga

11:46 keesj_ has joined ##openfpga

11:48 m4ssi has quit [Excess Flood]

11:48 IanMalcolm has quit [Remote host closed the connection]

11:48 keesj has quit [Ping timeout: 240 seconds]

11:57 BusterTheDummy has quit [Quit: ZNC 1.7.5 - https://znc.in]

11:57 IanMalcolm has joined ##openfpga

12:05 <OmniMancer> Hmmm, I am not sure if all the tiles are real or which are abstractions

12:22 <OmniMancer> daveshah: in the nextpnr generic backend, what does the "location" of a wire represent?

12:23 <daveshah> OmniMancer: it's just a nominal point used for delay estimates

12:23 <daveshah> Either source location or some kind of midpoint would work

12:25 <OmniMancer> and "pip" locations are then?

12:25 <OmniMancer> the location of the sink?

12:25 <daveshah> The location of the switch

12:25 <daveshah> Usually where the bitstream bits are for non pseudo pips

12:27 <OmniMancer> so the location of the sink end of the wire is fine then

12:28 Asu has joined ##openfpga

12:29 <OmniMancer> it seems each input can be explicitly tied to ground, but how does one determine what state an input is in when no mux setting is applied?

12:33 <OmniMancer> daveshah: do tile location suffice for the delay estimate?

12:33 <daveshah> Yes

12:33 <daveshah> Tile location is what ecp5 and ice40 use

12:33 <daveshah> Some experimentation might be needed for no mux setting values

12:34 <daveshah> Seeing what the tool does in certain cases

12:36 <OmniMancer> well AFAIK the tools default state is to set no bits

12:36 <OmniMancer> so I sort of expect that an all 0s bitfile will do nothing

12:38 <OmniMancer> I suppose LUT/FF inputs can be inferred by constructing a design that will give one or the other result based on the unconnected input state

12:44 <OmniMancer> Hmmm the PLLs in this part can apparently be dynamically configured

12:51 <daveshah> I would be careful with experimental results, a floating signal could end up either way depending on circumstances so a single experiment won't be perfect

12:52 <daveshah> Stuff usually floats high, although notably ECP5 still has explicit connections to 1 too

12:52 <daveshah> Xilinx otoh a connection to 1 doesn't set any bits

12:53 <sorear> does a single floating signal affect static power enough to measure?

12:53 <OmniMancer> No idea yet

12:53 <OmniMancer> I would have to set up a test

12:53 <OmniMancer> Or ask someone else to

12:59 X-Scale` has joined ##openfpga

13:01 X-Scale has quit [Ping timeout: 240 seconds]

13:01 X-Scale` is now known as X-Scale

13:08 <OmniMancer> Hmm it seems each tile only has 2 possible global clock inputs

13:14 <OmniMancer> but those global clock wires can be routed into the fabric

13:22 X-Scale has quit [Ping timeout: 265 seconds]

13:22 X-Scale has joined ##openfpga

13:51 X-Scale` has joined ##openfpga

13:52 X-Scale has quit [Ping timeout: 250 seconds]

13:53 X-Scale` is now known as X-Scale

14:00 <OmniMancer> hmmm, it appears you only get to pick a clock for the mslices and a clock for the lslices in a tile

14:19 <OmniMancer> I suspect the local wires are used to bridge the gaps in which interconnect wires can connect to which inputs

14:26 X-Scale` has joined ##openfpga

14:27 X-Scale has quit [Ping timeout: 240 seconds]

14:28 X-Scale` is now known as X-Scale

14:48 OmniMancer has quit [Quit: Leaving.]

15:25 freemint has joined ##openfpga

15:38 dh73 has joined ##openfpga

16:24 genii has joined ##openfpga

16:36 <tnt> Does anyone know if the sd card 4 bit mode is documented somewhre publically ?

16:36 <tnt> I mostly just find the spi mode documented.

16:37 massi_ has quit [Remote host closed the connection]

16:45 <miek> there are some details in the SDIO simplified specification, it doesn't look like the full spec is public

16:59 <tnt> yeah, I was hoping it leaked somewhere in all this time :p

17:00 <tnt> miek: btw, unrelated but ... I have your SMA antenna still.

17:24 Finde has quit [Ping timeout: 245 seconds]

17:30 <kc8apf> tnt: I see 4-bit mode described in Section 3.6.1 of SD Specifications Part 1 Physical Layer Simplified Specification

17:31 <kc8apf> unless you mean UHS-II which is in a separate addendum

17:36 Finde has joined ##openfpga

17:38 <gruetzkopf> is the sd express stuff publically described?

17:42 freemint has quit [Quit: Leaving]

17:53 <miek> tnt: oh yeah, i'll get it at congress :)

17:56 <kc8apf> wtf. sd express repurposes the UHS-II pins for PCIe lanes and steals a few pins from UHS-I for REFCLK, PERRST#, and CLKREQ#.

17:57 <kc8apf> their whitepaper claims that as long as you connect the PCIe signals to the right pins, it will train and show up as a normal NVMe device

18:01 <gruetzkopf> oh, neat

18:04 <GenTooMan> question is that outside the specification for the bus? IE are they doing something that it wasn't designed for.

18:05 <kc8apf> doesn't look like it. They recommend talking over SDIO to determine card capabilities but they explicitly mention PCIe initialization is supported

18:05 <cr1901_modern> tnt: Simplified spec is enough to build a 4-bit mode core if you want something that "just works".

18:07 <GenTooMan> As long as they don't go outside the specification protocol wise or bus wise it's probably fine to do.

18:14 m4ssi has joined ##openfpga

18:14 m4ssi has quit [Remote host closed the connection]

18:19 <GenTooMan> daveshah thanks for the hint about nextpnr set_frequency in the pcf file change it now works works correctly.

19:02 dh73 has quit [Quit: Leaving.]

19:30 mumptai has joined ##openfpga

19:35 freemint has joined ##openfpga

20:10 dh73 has joined ##openfpga

20:26 marcan has quit [Remote host closed the connection]

20:27 marcan has joined ##openfpga

20:41 <azonenberg_work> pcie sd cards???

20:41 <azonenberg_work> the power density of that must be ridiculous

20:43 <kc8apf> they claim 1.8W max

20:43 <kc8apf> which is.....a lot

20:44 rohitksingh has quit [Ping timeout: 250 seconds]

20:46 <TD-Linux> slightly weirded out by a future where sd cards have dma access to your system

20:53 <ZirconiumX> Or where PCIe turns into the USB philosophy of "Everything over ~~USB~~ PCIe"

20:58 <sorear> when do we get the future where acs/ats are widely supported

21:15 freemint has quit [Ping timeout: 245 seconds]

21:15 nrossi has quit [Quit: Connection closed for inactivity]

21:16 mkru has joined ##openfpga

21:31 lopsided98 has quit [Remote host closed the connection]

21:32 <kc8apf> sorear: Microsoft's Secured Core program requires firmware to enable IOMMU with all devices restricted

21:32 lopsided98 has joined ##openfpga

21:37 finsternis has quit [Ping timeout: 252 seconds]

21:47 finsternis has joined ##openfpga

21:52 mkru has quit [Quit: Leaving]

22:15 freemint has joined ##openfpga

22:26 steve|m has quit [Quit: Lost terminal]

22:52 Bike has joined ##openfpga

22:58 Asu has quit [Quit: Konversation terminated!]

23:31 mumptai has quit [Quit: Verlassend]