Received-SPF: Pass (sender SPF authorized) identity=mailfrom; client-ip=147.231.234.10; helo=mailgw.eli-beams.eu; envelope-from=prvs=1100baf582=danila.khikhlukha@eli-beams.eu; receiver=picongpu-users@hzdr.de 
From: Khikhlukha Danila <Danila.Khikhlukha@eli-beams.eu>
To: "picongpu-users@hzdr.de" <picongpu-users@hzdr.de>
Subject: RE:  [PIConGPU-Users] [PIConGPU-Users] [PIConGPU-Users]
 [PIConGPU-Users] Supercell concept
Thread-Topic: [PIConGPU-Users] [PIConGPU-Users] [PIConGPU-Users]
 [PIConGPU-Users] Supercell concept
Thread-Index: AQHSKhz3kl+S70weV0qoPpn0vQ4f26Cv6Gic
Date: Wed, 19 Oct 2016 16:00:11 +0000
Message-ID: <BA7C853FEE430847B9C35FFCC6E5B2A52821399C@braun.eli-beams.eu>
References: <list-14797034@cg1.fz-rossendorf.de>
In-Reply-To: <list-14797034@cg1.fz-rossendorf.de>
Accept-Language: en-US, cs-CZ
Content-Language: en-US
Content-Type: multipart/alternative;
	boundary="_000_BA7C853FEE430847B9C35FFCC6E5B2A52821399Cbraunelibeamseu_"
MIME-Version: 1.0

--_000_BA7C853FEE430847B9C35FFCC6E5B2A52821399Cbraunelibeamseu_
Content-Type: text/plain; charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

Dear Ren=E9,
thanks a lot, it is  way more clear now.
However I would like to ask you if you can recommend me some further readin=
g to develop more intuition regarding this dependency on the stencil and ma=
cro-particle shape?
Unfortunately at the moment I can't understand where 32 is coming from for =
the classical Yee scheme + CIC particle for instance.

Thank you in advance,
Danila
________________________________
From: picongpu-users@hzdr.de [picongpu-users@hzdr.de] on behalf of Ren=E9 W=
idera [r.widera@hzdr.de]
Sent: Wednesday, October 19, 2016 5:25 PM
To: picongpu-users@hzdr.de
Subject: Re: [PIConGPU-Users] [PIConGPU-Users] [PIConGPU-Users] [PIConGPU-U=
sers] Supercell concept

Dear Danila,

the supercell size defines the number of worker threads and the shared memo=
ry cache.
256 is a good value to utilize the most gpus.
A supercell size for each directions needs to be greater or equal to the ne=
eded neighbors of the stencil for the algorithms. This condition is checked=
 at compile time and depends on the selected solvers and the species shape.
The size of the supercell per direction is independent. The volume x=D7y=D7=
z of the supercell should be a multible of 32.

best,
Ren=E9

Am 19. Oktober 2016 17:14:28 MESZ, schrieb Khikhlukha Danila <Danila.Khikhl=
ukha@eli-beams.eu>:
Dear Ren=E9,
thank you for the prompt replay. Indeed the problem is quite simple. So wra=
p it up, the number of cell in each direction should give N % (N_gpu * N_su=
percells) =3D=3D 0.

Just for educational purpose: what is so special about the number 128 (I gu=
ess it is a size of a cache)? Could in be for instance 256? Would it be pos=
sible to specify the same number of super cells in X and Z direction?

Thanks a lot,
Danila.
________________________________
From: picongpu-users@hzdr.de [picongpu-users@hzdr.de] on behalf of Ren=E9 W=
idera [r.widera@hzdr.de]
Sent: Wednesday, October 19, 2016 4:59 PM
To: picongpu-users@hzdr.de
Subject: Re: [PIConGPU-Users] [PIConGPU-Users] Supercell concept

Dear Danila,

the volume per gpu needs to be a multiple of the superCell size.

In your case 4176/8gpus=3D522. 522 is not dividable bei 8.
4160 cells in y direction should solve your problem.

Please keep in mind if you change the superCell size to a value smaller tha=
n 128cells the most simulation run slower.
The default size with 8x8x4 shows the best result for the most cases.

best,

Ren=E9

Am 19. Oktober 2016 16:46:28 MESZ, schrieb Khikhlukha Danila <Danila.Khikhl=
ukha@eli-beams.eu>:
Dear all,
I have some troubles trying to specify a computational grid with moving win=
dows using picongpu v0.2.0. We were discussing this topic previously, howev=
er I have the same problem again, so likely I misunderstand something from =
the last time.

So I'm trying to launch the simulation using 4 K80 cards: 8 GPU devices ove=
rall. In the memory.param file I have specified the SuperCell layout as (2,=
8,2). I want to have one GPU in transversal direction and 8 in longitudinal=
. So in cfg file I specified:
TBG_gpu_x=3D1
TBG_gpu_y=3D8
TBG_gpu_z=3D1

Then I would like my real computational domain to have 256 x 3712 x 256 cel=
l. Since the moving window reduces the real domain by 1 GPU in y direction,=
 I specified my grid as 256 x 4176 x 256. (4176 =3D 9/8*3712)
TBG_gridSize=3D"-g 256 4176 256"

However, trying to submit such a cfg file I'm receiving an assertion fail:

void picongpu::MySimulation::checkGridConfiguration(PMacc::DataSpace<DIM>,P=
Macc::GridLayout<DIM>) [with unsigned int DIM =3D 3u]: Assertion`gridSizeLo=
cal[i] % MappingDesc::SuperCellSize::toRT()[i] =3D=3D 0' failed.

However 4176 % 8 =3D=3D 0 and 256 % 2 =3D=3D 0.

Could you please guide me how to solve this issue? Looks like I misundersta=
nd the concept of the SuperCell.

Thank you in advance,
Danila.


--
Diese Nachricht wurde von meinem Android-Mobiltelefon mit K-9 Mail gesendet=
.

--_000_BA7C853FEE430847B9C35FFCC6E5B2A52821399Cbraunelibeamseu_
Content-Type: text/html; charset="iso-8859-1"
Content-Transfer-Encoding: quoted-printable

<html dir=3D"ltr">
<head>
<meta http-equiv=3D"Content-Type" content=3D"text/html; charset=3Diso-8859-=
1">
<style type=3D"text/css" id=3D"owaParaStyle"></style>
</head>
<body fpstyle=3D"1" ocsi=3D"0">
<div style=3D"direction: ltr;font-family: Tahoma;color: #000000;font-size: =
10pt;">
<div style=3D"direction:ltr; font-family:Tahoma; color:#000000; font-size:1=
0pt">Dear Ren=E9,
<div>thanks a lot, it is &nbsp;way more clear now.</div>
<div>However I would like to ask you if you can recommend me some further r=
eading to develop more intuition regarding this dependency on the stencil a=
nd macro-particle shape?&nbsp;</div>
<div>Unfortunately at the moment I can't understand where 32 is coming from=
 for the classical Yee scheme &#43; CIC particle for instance.</div>
<div><br>
</div>
<div>Thank you in advance,</div>
<div>Danila<br>
<div style=3D"font-family:Times New Roman; color:#000000; font-size:16px">
<hr tabindex=3D"-1">
<div id=3D"divRpF133290" style=3D"direction:ltr"><font face=3D"Tahoma" size=
=3D"2" color=3D"#000000"><b>From:</b> picongpu-users@hzdr.de [picongpu-user=
s@hzdr.de] on behalf of Ren=E9 Widera [r.widera@hzdr.de]<br>
<b>Sent:</b> Wednesday, October 19, 2016 5:25 PM<br>
<b>To:</b> picongpu-users@hzdr.de<br>
<b>Subject:</b> Re: [PIConGPU-Users] [PIConGPU-Users] [PIConGPU-Users] [PIC=
onGPU-Users] Supercell concept<br>
</font><br>
</div>
<div></div>
<div>Dear Danila,<br>
<br>
the supercell size defines the number of worker threads and the shared memo=
ry cache.<br>
256 is a good value to utilize the most gpus.<br>
A supercell size for each directions needs to be greater or equal to the ne=
eded neighbors of the stencil for the algorithms. This condition is checked=
 at compile time and depends on the selected solvers and the species shape.=
<br>
The size of the supercell per direction is independent. The volume x=D7y=D7=
z of the supercell should be a multible of 32.<br>
<br>
best,<br>
Ren=E9<br>
<br>
<div class=3D"gmail_quote">Am 19. Oktober 2016 17:14:28 MESZ, schrieb Khikh=
lukha Danila &lt;Danila.Khikhlukha@eli-beams.eu&gt;:
<blockquote class=3D"gmail_quote" style=3D"margin:0pt 0pt 0pt 0.8ex; border=
-left:1px solid rgb(204,204,204); padding-left:1ex">
<div style=3D"direction:ltr; font-family:Tahoma; color:#000000; font-size:1=
0pt">Dear Ren=E9,
<div>thank you for the prompt replay. Indeed the problem is quite simple. S=
o wrap it up, the number of cell in each direction should give N % (N_gpu *=
 N_supercells) =3D=3D 0.</div>
<div><br>
</div>
<div>Just for educational purpose: what is so special about the number 128 =
(I guess it is a size of a cache)? Could in be for instance 256? Would it b=
e possible to specify the same number of super cells in X and Z direction?<=
/div>
<div><br>
</div>
<div>Thanks a lot,</div>
<div>Danila.<br>
<div style=3D"font-family:Times New Roman; color:#000000; font-size:16px">
<hr tabindex=3D"-1">
<div id=3D"divRpF491895" style=3D"direction:ltr"><font face=3D"Tahoma" size=
=3D"2" color=3D"#000000"><b>From:</b> picongpu-users@hzdr.de [picongpu-user=
s@hzdr.de] on behalf of Ren=E9 Widera [r.widera@hzdr.de]<br>
<b>Sent:</b> Wednesday, October 19, 2016 4:59 PM<br>
<b>To:</b> picongpu-users@hzdr.de<br>
<b>Subject:</b> Re: [PIConGPU-Users] [PIConGPU-Users] Supercell concept<br>
</font><br>
</div>
<div></div>
<div>Dear Danila,<br>
<br>
the volume per gpu needs to be a multiple of the superCell size.<br>
<br>
In your case 4176/8gpus=3D522. 522 is not dividable bei 8.<br>
4160 cells in y direction should solve your problem.<br>
<br>
Please keep in mind if you change the superCell size to a value smaller tha=
n 128cells the most simulation run slower.<br>
The default size with 8x8x4 shows the best result for the most cases.<br>
<br>
best,<br>
<br>
Ren=E9<br>
<br>
<div class=3D"gmail_quote">Am 19. Oktober 2016 16:46:28 MESZ, schrieb Khikh=
lukha Danila &lt;Danila.Khikhlukha@eli-beams.eu&gt;:
<blockquote class=3D"gmail_quote" style=3D"margin:0pt 0pt 0pt 0.8ex; border=
-left:1px solid rgb(204,204,204); padding-left:1ex">
<div style=3D"direction:ltr; font-family:Tahoma; color:#000000; font-size:1=
0pt">Dear all,
<div>I have some troubles trying to specify a computational grid with movin=
g windows using picongpu v0.2.0. We were discussing this topic previously, =
however I have the same problem again, so likely I misunderstand something =
from the last time.</div>
<div><br>
</div>
<div>So I'm trying to launch the simulation using 4 K80 cards: 8 GPU device=
s overall. In the memory.param file I have specified the SuperCell layout a=
s (2,8,2). I want to have one GPU in transversal direction and 8 in longitu=
dinal. So in cfg file I specified:</div>
<div><span style=3D"font-family:Helvetica; line-height:normal">TBG_gpu_x=3D=
</span><span style=3D"font-family:Helvetica; line-height:normal">1</span></=
div>
<div><span style=3D"font-family:Helvetica; line-height:normal"><span style=
=3D"line-height:normal">TBG_gpu_y=3D</span><span style=3D"line-height:norma=
l">8</span></span></div>
<div><span style=3D"font-family:Helvetica; line-height:normal"><span style=
=3D"line-height:normal"><span style=3D"line-height:normal">TBG_gpu_z=3D</sp=
an><span style=3D"line-height:normal">1</span></span></span></div>
<div><span style=3D"font-family:Helvetica; line-height:normal"><span style=
=3D"line-height:normal"><span style=3D"line-height:normal"><br>
</span></span></span></div>
<div><span style=3D"font-family:Helvetica; line-height:normal"><span style=
=3D"line-height:normal"><span style=3D"line-height:normal">Then I would lik=
e my real computational domain to have 256 x 3712 x 256 cell. Since the mov=
ing window reduces the real domain by 1
 GPU in y direction, I specified my grid as 256 x 4176 x 256. (4176 =3D 9/8=
*3712)</span></span></span></div>
<div><span style=3D"font-family:Helvetica; line-height:normal"><span style=
=3D"line-height:normal"><span style=3D"line-height:normal">TBG_gridSize=3D&=
quot;-g 256 4176 256&quot;</span></span></span></div>
<div><span style=3D"font-family:Helvetica; line-height:normal"><span style=
=3D"line-height:normal"><span style=3D"line-height:normal"><br>
</span></span></span></div>
<div><span style=3D"font-family:Helvetica; line-height:normal"><span style=
=3D"line-height:normal"><span style=3D"line-height:normal">However, trying =
to submit such a cfg file I'm receiving an assertion fail:</span></span></s=
pan></div>
<div><span style=3D"font-family:Helvetica; line-height:normal"><span style=
=3D"line-height:normal"><span style=3D"line-height:normal">
<p style=3D"margin-right:0px; margin-left:0px; font-size:10px; line-height:=
normal; font-family:Monaco; color:rgb(245,245,245); background-color:rgb(0,=
0,0)">
void picongpu::MySimulation::checkGridConfiguration(PMacc::DataSpace&lt;DIM=
&gt;,PMacc::GridLayout&lt;DIM&gt;) [with unsigned int DIM =3D 3u]: Assertio=
n`gridSizeLocal[i] % MappingDesc::SuperCellSize::toRT()[i] =3D=3D 0' failed=
.</p>
</span></span></span></div>
<div><span style=3D"font-family:Helvetica; line-height:normal"><span style=
=3D"line-height:normal"><span style=3D"line-height:normal">However 4176 % 8=
 =3D=3D 0 and 256 % 2 =3D=3D 0.&nbsp;</span></span></span></div>
<div><span style=3D"font-family:Helvetica; line-height:normal"><span style=
=3D"line-height:normal"><span style=3D"line-height:normal"><br>
</span></span></span></div>
<div><span style=3D"font-family:Helvetica; line-height:normal"><span style=
=3D"line-height:normal"><span style=3D"line-height:normal">Could you please=
 guide me how to solve this issue? Looks like I misunderstand the concept o=
f the SuperCell.</span></span></span></div>
<div><span style=3D"font-family:Helvetica; line-height:normal"><span style=
=3D"line-height:normal"><span style=3D"line-height:normal"><br>
</span></span></span></div>
<div><span style=3D"font-family:Helvetica; line-height:normal"><span style=
=3D"line-height:normal"><span style=3D"line-height:normal">Thank you in adv=
ance,</span></span></span></div>
<div><span style=3D"font-family:Helvetica; line-height:normal"><span style=
=3D"line-height:normal"><span style=3D"line-height:normal">Danila.</span></=
span></span></div>
<div><br>
</div>
</div>
</blockquote>
</div>
<br>
</div>
</div>
</div>
</div>
</blockquote>
</div>
<br>
-- <br>
Diese Nachricht wurde von meinem Android-Mobiltelefon mit K-9 Mail gesendet=
.</div>
</div>
</div>
</div>
</div>
</body>
</html>

--_000_BA7C853FEE430847B9C35FFCC6E5B2A52821399Cbraunelibeamseu_--