Добро пожаловать в Scribd!

Пропустить карусель

Hetero Lecture Slides 002 Lecture 2 Lecture 2 2 Control Divergence

Загружено:

Mikhail Nikitin

0% нашли этот документ полезным (0 голосов)

17 просмотров11 страниц

Авторское право

Доступные форматы

PDF, TXT или читайте онлайн в Scribd

Поделиться этим документом

Поделиться или встроить документ

Параметры публикации

Этот документ был вам полезен?

Это неприемлемый материал?

Пожаловаться на этот документ

Hetero Lecture Slides 002 Lecture 2 Lecture 2 2 Control Divergence

Авторское право:

Доступные форматы

Скачайте в формате PDF, TXT или читайте онлайн в Scribd

Отметить как неприемлемый контент

0% нашли этот документ полезным (0 голосов)

17 просмотров11 страниц

Hetero Lecture Slides 002 Lecture 2 Lecture 2 2 Control Divergence

Загружено:

Mikhail Nikitin

Hetero Lecture Slides 002 Lecture 2 Lecture 2 2 Control Divergence

Авторское право:

Доступные форматы

Скачайте в формате PDF, TXT или читайте онлайн в Scribd

Отметить как неприемлемый контент

Перейти к странице

Вы находитесь на странице: 1из 11

Поиск в документе

Lecture 2.

Kernel-based Parallel Programming

- Control Divergence

Objective

To understand the implications of

warp scheduling on control flow
Nature of control flow
Warp partitioning
Control divergence

Warps as Scheduling Units

Block 1 Warps

t0 t1 t2 t31

2 Warps
t0Block
t1 t2 t31

Block 1 Warps

t0 t1 t2 t31

Each Block is divided into 32thread Warps

An implementation decision,
not part of the CUDA
programming model
Warps are scheduling units
in SM
Threads in a warp execute
in SIMD

Back to Von-Neumann

Every instruction needs to be fetched

from memory, decoded, then executed.
Instructions come in three flavors:
Operate, Data transfer, and Program
Control Flow.
An example instruction cycle is the
following:
Fetch | Decode | Execute | Memory

Control Flow Operations

Example of control flow instruction:

BRp #-4
if the condition is positive, jump
back four instructions

How thread blocks are partitioned

Thread blocks are partitioned into

warps

Partitioning is always the same

Thread IDs within a warp are consecutive

and increasing
Warp 0 starts with Thread ID 0
Thus you can use this knowledge in
control flow
However, the exact size of warps may
change from generation to generation
(Covered next)

DO NOT rely on any ordering within or

between warps

If there are any dependencies between

threads, you must __syncthreads() to get
correct results (more later).

Control Flow Instructions

Main performance concern with

branching is divergence
Threads within a single warp
take different paths
Different execution paths are
serialized in current GPUs

The control paths taken by the

threads in a warp are traversed
one at a time until there is no
more.

Control Divergence Exampels

Divergence can arise only

when branch condition is a
function of thread indices
Example with divergence:

Example without divergence:

If (threadIdx.x > 2) { }
This creates two different control
paths for threads in a block
Branch granularity < warp size;
threads 0, 1 and 2 follow different
path than the rest of the threads
in the first warp

If (blockIdx.x > 2) { }
Branch granularity is a multiple
of blocks size; all threads in
any given warp follow the same
path

Example: Vector Addition Kernel

Device Code
// Compute vector sum C = A+B
// Each thread performs one pair-wise
addition

__global__
void vecAddKernel(float* A, float* B, float*
C, int n)
{

int i = threadIdx.x+blockDim.x*blockIdx.x;
if(i<n) C[i] = A[i] + B[i];
}
9

Analysis for vector size of 1,000 elements

All threads in Blocks 0, 1, and 2 are

within valid range
i values from 0 to 767
Block 3 will have control divergence
1st group i values from 768-999
2nd group i values from 1000-1023
Performance penalty to Blocks with no
divergence is very small
Only the last Block will have
divergence
Overall performance penalty is small
as long as there are large number of
elements.

TO LEARN MORE,READ
SECTION 4.3

Вам также может понравиться

The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
От Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Рейтинг: 4 из 5 звезд
4/5 (5795)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
От Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Рейтинг: 4 из 5 звезд
4/5 (1090)
Never Split the Difference: Negotiating As If Your Life Depended On It
От Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Рейтинг: 4.5 из 5 звезд
4.5/5 (838)
Principles: Life and Work
От Everand
Principles: Life and Work
Ray Dalio
Рейтинг: 4 из 5 звезд
4/5 (599)
The Glass Castle: A Memoir
От Everand
The Glass Castle: A Memoir
Jeannette Walls
Рейтинг: 4.5 из 5 звезд
4.5/5 (1713)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
От Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Рейтинг: 4 из 5 звезд
4/5 (895)
Sing, Unburied, Sing: A Novel
От Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Рейтинг: 4 из 5 звезд
4/5 (1103)
Grit: The Power of Passion and Perseverance
От Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Рейтинг: 4 из 5 звезд
4/5 (588)
Shoe Dog: A Memoir by the Creator of Nike
От Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Рейтинг: 4.5 из 5 звезд
4.5/5 (537)
The Perks of Being a Wallflower
От Everand
The Perks of Being a Wallflower
Stephen Chbosky
Рейтинг: 4.5 из 5 звезд
4.5/5 (2104)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
От Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Рейтинг: 4.5 из 5 звезд
4.5/5 (345)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
От Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Рейтинг: 4.5 из 5 звезд
4.5/5 (474)
Bad Feminist: Essays
От Everand
Bad Feminist: Essays
Roxane Gay
Рейтинг: 4 из 5 звезд
4/5 (1016)
Her Body and Other Parties: Stories
От Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Рейтинг: 4 из 5 звезд
4/5 (821)
The Outsider: A Novel
От Everand
The Outsider: A Novel
Stephen King
Рейтинг: 4 из 5 звезд
4/5 (1839)
The Emperor of All Maladies: A Biography of Cancer
От Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Рейтинг: 4.5 из 5 звезд
4.5/5 (271)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
От Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Рейтинг: 4.5 из 5 звезд
4.5/5 (121)
Angela's Ashes: A Memoir
От Everand
Angela's Ashes: A Memoir
Frank McCourt
Рейтинг: 4.5 из 5 звезд
4.5/5 (440)
Brooklyn: A Novel
От Everand
Brooklyn: A Novel
Colm Tóibín
Рейтинг: 3.5 из 5 звезд
3.5/5 (1937)
The Little Book of Hygge: Danish Secrets to Happy Living
От Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Рейтинг: 3.5 из 5 звезд
3.5/5 (400)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
От Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Рейтинг: 3.5 из 5 звезд
3.5/5 (2259)
A Man Called Ove: A Novel
От Everand
A Man Called Ove: A Novel
Fredrik Backman
Рейтинг: 4.5 из 5 звезд
4.5/5 (4610)
The Art of Racing in the Rain: A Novel
От Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Рейтинг: 4 из 5 звезд
4/5 (4200)
A Tree Grows in Brooklyn
От Everand
A Tree Grows in Brooklyn
Betty Smith
Рейтинг: 4.5 из 5 звезд
4.5/5 (1929)
The Yellow House: A Memoir (2019 National Book Award Winner)
От Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Рейтинг: 4 из 5 звезд
4/5 (98)
Steve Jobs
От Everand
Steve Jobs
Walter Isaacson
Рейтинг: 4.5 из 5 звезд
4.5/5 (806)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
От Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Рейтинг: 4.5 из 5 звезд
4.5/5 (266)
The Woman in Cabin 10
От Everand
The Woman in Cabin 10
Ruth Ware
Рейтинг: 3.5 из 5 звезд
3.5/5 (2322)
Yes Please
От Everand
Yes Please
Amy Poehler
Рейтинг: 4 из 5 звезд
4/5 (1891)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
От Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Рейтинг: 3.5 из 5 звезд
3.5/5 (231)
Team of Rivals: The Political Genius of Abraham Lincoln
От Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Рейтинг: 4.5 из 5 звезд
4.5/5 (234)
Fear: Trump in the White House
От Everand
Fear: Trump in the White House
Bob Woodward
Рейтинг: 3.5 из 5 звезд
3.5/5 (738)
Wolf Hall: A Novel
От Everand
Wolf Hall: A Novel
Hilary Mantel
Рейтинг: 4 из 5 звезд
4/5 (3811)
John Adams
От Everand
John Adams
David McCullough
Рейтинг: 4.5 из 5 звезд
4.5/5 (2409)
On Fire: The (Burning) Case for a Green New Deal
От Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Рейтинг: 4 из 5 звезд
4/5 (74)
The Light Between Oceans: A Novel
От Everand
The Light Between Oceans: A Novel
M.L. Stedman
Рейтинг: 4.5 из 5 звезд
4.5/5 (789)
The Unwinding: An Inner History of the New America
От Everand
The Unwinding: An Inner History of the New America
George Packer
Рейтинг: 4 из 5 звезд
4/5 (45)
Manhattan Beach: A Novel
От Everand
Manhattan Beach: A Novel
Jennifer Egan
Рейтинг: 3.5 из 5 звезд
3.5/5 (792)
The Constant Gardener: A Novel
От Everand
The Constant Gardener: A Novel
John le Carré
Рейтинг: 3.5 из 5 звезд
3.5/5 (104)
Rise of ISIS: A Threat We Can't Ignore
От Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Рейтинг: 3.5 из 5 звезд
3.5/5 (137)
Little Women
От Everand
Little Women
Louisa May Alcott
Рейтинг: 4 из 5 звезд
4/5 (104)
Introduction To Algorithms - Design Techniques and Analysis M. Alsuwaiyel - Chapter - 1
Документ70 страниц
Introduction To Algorithms - Design Techniques and Analysis M. Alsuwaiyel - Chapter - 1
testuser_scribd
0% (1)
SAP Functional Area Deriving
Документ1 страница
SAP Functional Area Deriving
KIK
Оценок пока нет
Sony Clie PEG-UX50 Handheld
Документ1 страница
Sony Clie PEG-UX50 Handheld
Darryl Rhea
Оценок пока нет
Gail Gas Dewas Kota
Документ95 страниц
Gail Gas Dewas Kota
pkkothari
Оценок пока нет
ISF - IRAM2 - Risk Evaluation and Risk Treatment Assistant - Practitioner Guide
Документ19 страниц
ISF - IRAM2 - Risk Evaluation and Risk Treatment Assistant - Practitioner Guide
keeperr7
Оценок пока нет
Install Squid3 Untuk Transparent Proxy
Документ4 страницы
Install Squid3 Untuk Transparent Proxy
Nursalim S kom
Оценок пока нет
15.2 - Lab Manual - Static Members in Class
Документ5 страниц
15.2 - Lab Manual - Static Members in Class
anas bilal
Оценок пока нет
Instructions Cyborg Beast Updated 6-26-14
Документ18 страниц
Instructions Cyborg Beast Updated 6-26-14
fredyuga
Оценок пока нет
Cfengine Essentials
Документ260 страниц
Cfengine Essentials
Saurajeet Dutta
Оценок пока нет
The Vanity of Arts and Sciences - Agrippa
Документ403 страницы
The Vanity of Arts and Sciences - Agrippa
asdfasdfasdfasdf29
Оценок пока нет
License
Документ91 страница
License
Ravi Agrwl
Оценок пока нет
Fail Over Process
Документ8 страниц
Fail Over Process
shaikali1980
Оценок пока нет
Project Identification & Selection Project Initiation & Planning
Документ8 страниц
Project Identification & Selection Project Initiation & Planning
Rames Ap
Оценок пока нет
A Protection Mechanism For Intellectual Property Rights (IPR) in FPGA Design Environment
Документ4 страницы
A Protection Mechanism For Intellectual Property Rights (IPR) in FPGA Design Environment
ajith.ganesh2420
Оценок пока нет
EkShiksha Question Bank Report1
Документ159 страниц
EkShiksha Question Bank Report1
paradox
Оценок пока нет
Seis Wide How To
Документ10 страниц
Seis Wide How To
Leslie Ruo
100% (2)
Power Off Reset Reason Backup
Документ4 страницы
Power Off Reset Reason Backup
Cristhofer Diaz
Оценок пока нет
How To Set Up A SOCKS Proxy Using Putty & SSH
Документ3 страницы
How To Set Up A SOCKS Proxy Using Putty & SSH
john
100% (1)
Scripts For Configuring IP Multi Pa Thing in Oracle Solaris
Документ12 страниц
Scripts For Configuring IP Multi Pa Thing in Oracle Solaris
p.segoulin
100% (1)
Notes On Clubhouse App-Updated
Документ8 страниц
Notes On Clubhouse App-Updated
abhishekm1992
Оценок пока нет
Datasheet ADR 155
Документ2 страницы
Datasheet ADR 155
jorlugon
Оценок пока нет
Python Nuke v1.1.4
Документ3 страницы
Python Nuke v1.1.4
BoardkingZero
Оценок пока нет
15 Ques
Документ4 страницы
15 Ques
SAKTHIVEL
Оценок пока нет
Trading Central Web Services
Документ16 страниц
Trading Central Web Services
jeanelkhoury
Оценок пока нет
Huawei Esight 23.0 Brochure
Документ6 страниц
Huawei Esight 23.0 Brochure
mohamed benjelloun
Оценок пока нет
3573 TS3200, 3100 Setup, Operator, and Service Guide
Документ361 страница
3573 TS3200, 3100 Setup, Operator, and Service Guide
Kumar Pallav
Оценок пока нет
CMDB2.0.1 DevelopersReferenceGuide
Документ386 страниц
CMDB2.0.1 DevelopersReferenceGuide
markiitot
Оценок пока нет
Objectives: Digital Product AI & Machine Learning Optimize Supply Chain Customer Services
Документ2 страницы
Objectives: Digital Product AI & Machine Learning Optimize Supply Chain Customer Services
jayanti acharya
Оценок пока нет
Microprocessors and Microcontrollers Notes
Документ293 страницы
Microprocessors and Microcontrollers Notes
Kumar Chaitanya
Оценок пока нет
Lom Log
Документ71 страница
Lom Log
Geraldine Alvarado
Оценок пока нет