Monday, November 20, 2023

Why raising tREFI or lowering tRFC has diminshing returns on performance

 I'm pretty sure I've mentioned this in a video or something but anyway: 

As tRFC ÷ tREFI approaches 0 further reductions in tRFC or increases in tREFI will impact performance less and less.
 
tREFI increasing:
800tRFC with 8 000 tREFI = 10% of clock cycles are spent on refresh 
800tRFC with 16 000 tREFI = 5% of clock cycles are spent on refresh 
800tRFC with 32 000 tREFI = 2.5% of clock cycles are spent on refresh
800tRFC with 64 000 tREFI = 1.25of clock cycles are spent on refresh
800 tRFC with 128 000 tREFI = 0.625% of clock cycles are spent on refresh
 
so going from 8K to 32K gives 7.5% more useful clock cycles >> big performance increase
going from 64K to 128K gives you an extra 0.625% >> basically no performance increase
 
tRFC decreasing:
800tRFC with 8 000 tREFI = 10% of clock cycles are spent on refresh 
400tRFC with 8 000 tREFI = 5% of clock cycles are spent on refresh 
200tRFC with 8 000 tREFI = 2.5% of clock cycles are spent on refresh 
100tRFC with 8 000 tREFI = 1.25% of clock cycles are spent on refresh 

incase you don't know what tRFC and tREFI are:
DDR memory needs to be "refreshed" in order to retain data because it stores data as charge on capacitors that slowly discharge over time.
tRFC is how long the memory controller waits after sending a REFRESH command before sending other commands (like ACTIVATE, READ, WRITE, PRECHARGE....)
tREFI is how often the memory controller sends REFRESH commands.

Tuesday, May 2, 2023

(WIP) The state of my motherboard collection/wishlist

Boards that are indented are boards I have and intend to keep. Boards that aren't indented are boards I don't have yet... Boards that I only have for review purposes are not on this list. WIP in the sense that this list doesn't include every board in my collection yet. My dream is to some day have test data for most if not ALL of these boards as well as PCB breakdowns.

INTEL

LGA1700
ASUS Z790 Apex
ASUS Z790i Strix
        EVGA Z790 DARK
EVGA Z690 DARK
        GIGABYTE Z790 Tachyon
        GIGABYTE Z690 Tachyon
        MSI Z790i EGDE
MSI Z690i Unify
        MSI Z690 Unify X
Asrock Z690 Aqua OC

LGA1200
ASUS Z590 Apex    
        ASUS Z490 Apex
        EVGA Z590 DARK
EVGA Z490 DARK
        Gigabyte Z590 Tachyon
MSI Z590 Unify X
        Asrock Z590 OC Formula

LGA1151v2
ASUS Z390 Apex
        ASUS Z390 Gene    // ASUS did eventually fix A2 DDR4 PCB compatibility
ASUS Z390 Extreme    // contestant for best T-top
Gigabyte Z390 Xtreme     // contestant for best T-top
        Gigabyte Z390 Master    // very solid T-top
        Gigabyte Z390i Pro    // tried to power bridge mod a CPU on this

MSI Z390i    // RAM OC
MSI Z390 Godlike    // should be a beast daisy-top
        EVGA Z390 DARK    // amazing as long as you use the right mem sticks
        Asrock Z390i Phantom Gaming    // good VRM otherwise kinda meh

LGA1151
        ASUS Z270 Apex    // had some issues last time I used it
ASUS Z170 Impact
Gigabyte Z270 Gaming SOC
        Gigabyte Z270i Gaming
MSI Z270 Xpower Titanium
Asrock Z170M OC Formula
        EVGA Z170 Stinger    // ITX with a POST CODE!

LGA1150
ASUS Z97 Impact
        ASUS Z87 Extreme
        ASUS Z87 Impact    // broken chipset / BIOS
Gigabyte Z97 SOC Force
        Gigabyte Z87 G1 Sniper 5
MSI Z97 Xpower
        MSI Z87 Xpower // BIG BOARD WITH BIG VRM
        Asrock Z97 OC Formula    // fixed bent pins
Asrock Z97M OC Formula ??
        Asrock Z87 OC Formula    // very broken
        Asrock Z87M OC Formula    // fixed bent pins

LGA1155
ASUS Z77 Extreme
ASUS Z77i Deluxe
        ASUS Z68 Extreme-Z
        Gigabyte Z77 UP7    // I have 2 of these. don't ask me why
Gigabyte Z68 UD7
        Gigabyte Z68 UD5    // huge VRM using cascaded doublers need I say more
        EVGA Z77 FTW    


LGA2066
ASUS Rampage VI Extreme Encore
        ASUS Rampage VI Apex // still haven't fixed SIO voltage reading issues
        EVGA X299 DARK    // I wish it had VTTDDR control
        EVGA X299 Micro 2    // mini-DARK modded for better RAM OC
        Asrock X299 OC Formula    // weak RAM OC on Sky-X. Good for Kaby-X on LN2

LGA2011v3
ASUS Rampage V Edition 10
        Gigabyte X99-SOC Champion // just an amazing board in general
MSI X99A XPower
        MSI X99A XPower Titanium    // great at ambient but has CB bellow -90C
Asrock X99M Killer USB3.1
Asrock X99 OC Formula
        EVGA X99 Classfied    // it was very cheap on Ebay IIRC

LGA2011
        ASUS Rampage IV Black Edition // fixed bent pins
        ASUS Rampage IV Extreme    // my first motherboard
        Gigabyte X79 UD7
        MSI X79 Big Bang II Xpower    // BIG wacky VRM
        Asrock X79 Extreme 7     // wacky RAM topology
        Asrock X79 Extreme 11    // unfortunately very broken

LGA1366
ASUS Rampage 3 Black Edition
        ASUS Rampage 3 Extreme
ASUS Rampage 2 Extreme
        Gigabyte X58A-OC     // I've got 2 of these :D one needed pin unbending
        Gigabyte X58A-UD9     // BIG WEIRD VRM
        Gigabyte X58A G1.Assasin    // BIG GREEN BOARD
        Gigabyte X58 Extreme    // broken pins causing 1 RAM channel to not work
MSI X58 Big Bang
        EVGA X58 Classified 4way
EVGA X58 Classified 3
EVGA X58 Classified
        EVGA SR-2
        Sapphire X58 Pure Black    // I AM A MASSIVE SAPPHIRE FAN
        Foxconn X58 Blood Rage    // had to E-power the QPI? VRM
Biostar X58 T-power    // probably sucks but looks intresting
DFI UT X58 T3eH8    // I want at least 1 DFI board but it can't be on socket I don't already have

 

AMD

AM5
ASUS X670E Gene
        Gigabyte B650E Tachyon
MSI X670E Carbon
ASrock X670E Taichi

AM4
ASUS X570 Extreme
ASUS X570 DARK Hero
ASUS X570 Impact
        ASUS X570i Strix    // modded for 1.8+ VDDR and better VTTDDR control
        ASUS X470 Hero    // meh at ambient. easy LN2 with 2700X
ASUS B550i Strix
        Gigabyte X570 Xtreme    // my current daily board
        Gigabyte X570i PRO ES    // fished out of the recycling bin at Gigabyte HQ
        Gigabyte X570S Master    // great RAM OC. Awful Vcore regulation
        Gigabyte B550 Vision D-P    // SOC VRM exploded once. I repaired it
        Gigabyte X470 Gaming 7    // haven't repaired yet / best T-top?
        Gigabyte X370 Gaming K7    // best T-top?
        Gigabyte X370 Gaming K5    // oof VRM great T-top
        MSI X570 Godlike    // the only "good" thing about this is the PCI-e layout
    
    MSI X570 Unify X    // super strong board with a chipset that hates BCLK OC
        MSI B550 Unify X    // amazing OC board
        MSI B450 Mortat Titanium    // it's TITANIUM
Asrock B550 Taichi     // B550 Steel Legend with upgrades?
        Asrock B550 Steel Legend    // great voltage regulation. Solid RAM OC

AM3(+)
        ASUS 990FX Formula-Z
        ASUS 990FX Formula
        ASUS 890FX Extreme
        ASUS 890FX Formula
Gigabyte 990FX UD7
        Gigabyte 990FX UD5    // doesn't go over 1.9ish Vcore IIRC
        Gigabyte 990FX G1 Gaming    // VRM sucks but RAM OC and valids are STRONK
        Gigabyte 970 UD3P    // VRM modification test subject
        Gigabyte 890FX UD7
MSI 990FXA GD80
MSI 890FXA GD70
        Asrock 990FX Extreme 9    // BIOS is mega jank
        Asrock 990FX Fatal1ty Profesional    // seems OK
        Sapphire 990FX Pure Black     // Sapphire fanboying INTENSIFIES
        Biostar 890FXE    // there's an inductor on this board that isn't connected to anything

 FM2(+)     
ASUS A88X Ranger
    
    ASUS A88X-Pro    // modded for extra Vcore and I think broken ATM
Gigabyte A88X UP4
Gigabyte F2A88XN    // better RAM OC than 4 dimmers?
        Gigabyte A85X UP4
MSI A88X-G45 Gaming    // IDK if it's any good but it looks cool IMO

sTR4
ASUS Zenith II Extreme
        Gigabyte TRX40 Master    // repaired a VDDR phase
        MSI TRX40 Creation

Thursday, April 20, 2023

DDR memory latency

DDR memory latency not including memory controller for a single READ command.

READ to an already active row
tCL

READ with memory idle
tRCD >> tCL

READ to a different row in the same bank as an already active row
tRP >> tRCD >> tCL

READ right before or during a refresh
tRFC >> tRCD >> tCL

READ to a different row in the same bank in which a row was just activated
tRAS >> tRP >> tRCD >> tCL

Timings are measured in clock cycles. Therefore DDR4-3200 CL16 has the same memory latency as DDR4-4000 CL20. Same for say DDR5-6000 CL30 and DDR5 7600 CL38. 

To get the ns latency of a timing just divide the timing by the memory clock in GHz(which is half the data rate). For example:

DDR4-3200 CL16
memory clock = 1.6GHz
16/1.6GHz = 10ns

DDR5-7200 CL34
memory clock = 3.6GHz
34/3.6 = 9.44ns

At this point it might seem that as long as the memory timing latency in ns is the same the memory clock has no impact on overall memory latency. However this is not true because the RAM is not directly connected to the CPU cores. It is connected to a memory controller. The memory controller adds it's own latency to the latency of the RAM increasing the overall core to memory latency. How much latency the memory controller adds depends on it's design and clock speed. The clock of the memory controller is directly related to the memory clock. Usually in ratios like 1:1 1:2 and 1:4. Therefore to reduce the latency from the memory controller it is necessary to raise the memory clock.

The memory controller itself is also not usually directly connected to the CPU cores. There is typically an interconnect between the cores and memory controller which adds even more latency. Again how much latency is added depends on the design of the interconnect and it's clock speed. So again the latency can be reduced by raising the clock speed of the interconnect. The interconnect may or may not have a fixed ratio relationship with the memory controller clock potentially with a latency penalty for having to buffer data.

Finally how quickly the cores can process the data arriving from the memory will somewhat influence any software memory latency test.

Basically on intel
more core clock >> less latency
more ring clock >> less latency
more MC clock >> less latency
lower timings >> less latency *

on AMD Ryzen 7000
more core clock >> less latency
more IF clock >> less latency **
more MC clock >> less latency
lower timings >> less latency *

on AMD Ryzen 5000
more core clock >> less latency
more IF clock >> less latency
more MC clock >> less latency
synchronized IF and MC clock >> less latency
lower timings >> less latency *

 

* tREFI is the one timing that you need to increase in order to reduce memory latency. This is because increasing the tREFI reduces the probability that a READ command will have to wait for a refresh to complete as refreshes will be less frequent.

** Ryzen 7000 doesn't seem to have IF to MC synchronization because the IF runs at around 2GHz while the memory controller runs at 2.4-3.2GHz(3.2GHz if you're lucky) so the data beween the MC and IF is always buffered and there's no "synchronization bonus"