Running a Datacenter Performance Optimization Campaign by Nadav Rotem

Optimising Code - Computerphile

SIMD and vectorization using AVX intrinsic functions (Tutorial)

Zahraj si se mnou 2! #shorts

PROČ NEMÁM RÁD 69

1🥺🎉 #thankyou

The Art of SIMD Programming by Sergey Slotin

Performance Summit

zhlédnutí 8 937

Přidat do
- Můj playlist
- Přehrát později
Sdílet

Sdílet

Vložit

Velikost videa:

Zobrazit ovladače přehrávání

Automatické přehrávání

Přehrát

čas přidán 8. 09. 2022
Modern hardware is highly parallel, but not only in terms of multiprocessing. There are many other forms of parallelism that, if used correctly, can greatly boost program efficiency - and without requiring more CPU cores. One such type of parallelism actively adopted by CPUs is "Single Instruction, Multiple Data" (SIMD): a class of instructions that can perform the same operation on a block of 16, 32, or 64 bytes of data in one go, yielding a proportional speedup over scalar code.
While SIMD shares many similarities with classic multiprocessor computing, it is quite different and often requires creative use of the instruction set. In this talk, we will give a general introduction to the technology (focusing on x86/AVX2), derive and implement several state-of-the-art SIMD algorithms, and discuss their use in impactful open-source projects.
skillsmatter.com/skillscasts/...
Věda a technologie

Komentáře • 5

@yuangchen905 Před rokem ⁺³
great video. Thank very much for your lightening example and insightful explanation!
@Roxas99Yami Před rokem ⁺¹
Thanks very appreciated. Especially the examples in C. Is this directky compatible in Cython ?
@Roxas99Yami Před rokem
The intrinsics i mean
@martingeorgiev999 Před rokem ⁺³
I don't understand why these architecture specific instructions are not recognized directly by gcc on O3.
@bouazzase4202 Před rokem ⁺⁹
they are, when you give the -march= argument, otherwise the compiler doesn't know which instruction sets are allowed and will fall back to a default (usually x86-64 without avx)

Další v pořadí

Automatické přehrávání

Running a Datacenter Performance Optimization Campaign by Nadav Rotem

Running a Datacenter Performance Optimization Campaign by Nadav Rotem

Optimising Code - Computerphile

Optimising Code - Computerphile

SIMD and vectorization using AVX intrinsic functions (Tutorial)

SIMD and vectorization using AVX intrinsic functions (Tutorial)

Zahraj si se mnou 2! #shorts

Zahraj si se mnou 2! #shorts

PROČ NEMÁM RÁD 69

PROČ NEMÁM RÁD 69

The Worlds Most Powerfull Batteries !

The Worlds Most Powerfull Batteries !

Where Have All the Cycles Gone? by Sean Parent

Where Have All the Cycles Gone? by Sean Parent

A unifying force (2024): an Abdus Salam documentary

A unifying force (2024): an Abdus Salam documentary

Extreme SIMD: Optimized Collision Detection in Titanfall

Extreme SIMD: Optimized Collision Detection in Titanfall

Performance: SIMD, Vectorization and Performance Tuning | James Reinders, former Intel Director

Performance: SIMD, Vectorization and Performance Tuning | James Reinders, former Intel Director

Branchless Programming in C++ - Fedor Pikus - CppCon 2021

Branchless Programming in C++ - Fedor Pikus - CppCon 2021

Pushing Java to the Limits: Processing a Billion Rows in under 2 Seconds by ROY VAN RIJN

Pushing Java to the Limits: Processing a Billion Rows in under 2 Seconds by ROY VAN RIJN

CPU Cache Effects - Sergey Slotin - Meeting C++ 2022

CPU Cache Effects - Sergey Slotin - Meeting C++ 2022

unlock the lowest levels of coding

unlock the lowest levels of coding

Faster than Rust and C++: the PERFECT hash table

Faster than Rust and C++: the PERFECT hash table

Skoro Perfektný - OnePlus Open Recenzia

Skoro Perfektný - OnePlus Open Recenzia

Power up all cell phones.

Power up all cell phones.

Showing Scammers Their Own CCTV Cameras On My Computer!

Showing Scammers Their Own CCTV Cameras On My Computer!

The Coolest PSU | ROG Thor 1000w Platinum II Eva Edition ASMR Unboxing

The Coolest PSU | ROG Thor 1000w Platinum II Eva Edition ASMR Unboxing

Google's secret algorithm exposed via leak to GitHub…

Google's secret algorithm exposed via leak to GitHub…

#smartphone #applecase #phonecase #iphone11case #iphonecase #backcase #iphone13promaxcase

#smartphone #applecase #phonecase #iphone11case #iphonecase #backcase #iphone13promaxcase

NEW iPad Air 2024 🤔 "purple" iPad Air M2 unboxing

NEW iPad Air 2024 🤔 "purple" iPad Air M2 unboxing

Разоблачение ручное зарядное устройство

Разоблачение ручное зарядное устройство