Encryption Projects as SU group

Lak7

Registered
Messages
35
Compiled to run on Compute Capability 8.0, 8.6, 8.9, 9.0.
When launched, it checks for the compute compatibility, and used the appropriate code. That's why the size of the build changes - the less different cards included, the smaller the app (in megabytes).
If I included all cards, the app would be around 14MB.
The only real difference between the 2 is v12_3_VS22 has the number of registers limited to 167, and v12_3_64 is set to Max (255)
Just a difference is the SM architecture from 3xxx to 4xxx. Same happened from 1xxx to 2xxx.
 

Lak7

Registered
Messages
35
Test using 4xxx cards ....

SHA256: 785AB8A8E810E7433D5203DB1BCDC8F5C92C737232A4F345A6325174B1076E00
 
Last edited:

moonbase

VIP
Donating Member
Messages
543
The only real difference between the 2 is v12_3_VS22 has the number of registers limited to 167, and v12_3_64 is set to Max (255)
Just a difference is the SM architecture from 3xxx to 4xxx. Same happened from 1xxx to 2xxx.

The SM architecture change from 3xxx to 4xxx cards has not impacted the CudaBISS speed with an RTX 4090 card until v12_3_VS22.
With each advance in version release up to v12_3_64 the RTX ran faster and peaked with v12_3_64.
Moving on to release v12_3_VS22 was the first time a speed reduction was observed compared to the preceding version.

Could the reduced number of registers in v12_3_VS22 be causing it to run slower with an RTX 4090 compared to v12_3_64?
 

moonbase

VIP
Donating Member
Messages
543
Test using 4xxx cards ....

Thank you for sharing the new version.
Is this version specifically for 4xxx series cards or will it run on 3xxx series cards and earlier series?
 

moonbase

VIP
Donating Member
Messages
543
Test using 4xxx cards ....

A quick test shows that with an RTX 4090 card the new CudaBISS version of v12_3_VS22 (4xxx) runs faster than v12_3_64 with the same card.
For an RTX 4090, v12_3_VS22 (4xxx) is the fastest version to date.
 

Lak7

Registered
Messages
35
A quick test shows that with an RTX 4090 card the new CudaBISS version of v12_3_VS22 (4xxx) runs faster than v12_3_64 with the same card.
For an RTX 4090, v12_3_VS22 (4xxx) is the fastest version to date.
About 10% or so?
 

spot

Registered
Messages
17
May I ask which Cuda biss Build you use for 3060 Ti card ? If so where could I finf it,,Thank you
 

moonbase

VIP
Donating Member
Messages
543
May I ask which Cuda biss Build you use for 3060 Ti card ? If so where could I finf it,,Thank you


Reply 256 of this topic contains a link to the 3xxx card build.
Will you be reporting back to share your results?
 

spot

Registered
Messages
17
Reply 256 of this topic contains a link to the 3xxx card build.
Will you be reporting back to share your results?
I saw that before but the link doesn't work for me. Thanks I found a file to use elsewhere. About 4.8B running 2 instances.
 

spot

Registered
Messages
17
I'm using a Intel i7-12700k/Msi z690 pro-a /32 gig ram /msi 3060 ti / corsair 850 PS.
109999999999
C00000000000
474065926C7C1938C608F122FF3AE37E
4740659CAED1E977D78AB1AE1BCF68C4
4740659AF73EB8CA49C814B3B4599447
1
1
 

moonbase

VIP
Donating Member
Messages
543
I'm using a Intel i7-12700k/Msi z690 pro-a /32 gig ram /msi 3060 ti / corsair 850 PS.


That PC spec is probably as fast as there is for CudaBISS. The GPU can run at full PCIe 4.0 speed as the board should have PCIe 5.0 speed on PCIe x16 slot 1.
 
Top