<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom">
	<channel>
      <title>Tagged with windows - NVIDIA Developer Forums</title>
      <link>http://forums.developer.nvidia.com/devforum/discussions/tagged/windows/feed.rss</link>
      <pubDate>Wed, 16 May 12 17:37:31 -0400</pubDate>
         <description>Tagged with windows - NVIDIA Developer Forums</description>
   <language>en-CA</language>
   <atom:link href="/devforum/discussions/taggedwindows/feed.rss" rel="self" type="application/rss+xml" />
   <item>
      <title>Tesla vs GTX560M this is wierd!</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/8311/tesla-vs-gtx560m-this-is-wierd</link>
      <pubDate>Wed, 16 May 2012 16:32:03 -0400</pubDate>
      <dc:creator>Bleiner</dc:creator>
      <guid isPermaLink="false">8311@/devforum/discussions</guid>
      <description><![CDATA[Hello Everyone,<br /><br />So I have been working on porting over a lot of the more computationally intensive portions of my code from matlab to .ptx gpu code.  I have been doing the development on my laptop where I have made HUGE increases in speed.  Achieving at least a 10X speed up in a lot of areas.  Now all my work is double precision, and involves millions of objects.  My laptop with a gaming GPU (that by design has hampered double precision performance, and fewer processing cores) can complete a task in about 46 seconds.  <br /><br />I have a desktop with a Tesla C2075, that typically outperforms my laptop by a factor of two.  When I bring this code over to the Tesla machine it is running anywhere from 42-48 seconds to complete the same task.  Does anyone have any idea why this would be?  <br /><br />The only thing that comes to mind on this is that I upgraded my laptop to the 301.40 driver version to use Nsight visual studio, while the Tesla machine is still on 301.32 (or someething like that.  When I attempted to upgrade the tesla machine it appears that the 301.40 drivers have been removed from the website.  When I upgraded to the Beta Cuda 5 version drivers 301.53 (i think) Matlab no longer recognizes that there is a GPU attached to the system.  <br /><br />Could the issues be the driver?  Did it improve that much from 301.27 to 301.4?  Does it have anything to do with the GPU on the laptop being compute level 2.1 while tesla is 2.0?  Is there a memory manager issue that 2.1 does a lot better?  <br /><br />The strange thing is that TESLA USED to perform at twice the speed of the laptop (and that includes all the overhead in matlab that is taking place on the CPU.  So Tesla must have been significantly more than 2x faster.  <br /><br />Any thoughts?<br /><br />Thanks<br />Ben<br /><br />As an after thought here are the specs of the machines in question.. <br />Tesla Work station<br />Dell Precision T5500<br />Xeon E5620 2.4ghz<br />12gb ram<br />Telsa C2075 with 6gb ram driver 301.32<br /><br />Laptop<br />Asus G74S<br />Core I7 2670QM 2.2ghz<br />12gb Ram<br />GTX 560M 3gb ram Driver 301.40<br />]]></description>
   </item>
      <item>
      <title>DisplayLink BVU195 USB adapter</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/8301/displaylink-bvu195-usb-adapter</link>
      <pubDate>Wed, 16 May 2012 12:54:21 -0400</pubDate>
      <dc:creator>tnoonkes</dc:creator>
      <guid isPermaLink="false">8301@/devforum/discussions</guid>
      <description><![CDATA[I have a Lenovo T410 with the NVIDIA NVS 3100M. I recently bought a USB to DVI adapter so that i could use a 3rd monitor with my laptop. *(Laptop monitor is primary with 2 other samsung monitors attached, one via DVI input from laptop dock, second from the USB adapter.) I have updated my video drivers to the latest drivers but when i install the DisplayLink software for my USB to DVI, my computer crashes every time when i reboot windows. Please let me know if there is any fix for that adapter and NVS 3100M. Thanks.]]></description>
   </item>
      <item>
      <title>downloading of the installer doesn&#039;t complete</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/8266/downloading-of-the-installer-doesnt-complete</link>
      <pubDate>Wed, 16 May 2012 05:59:53 -0400</pubDate>
      <dc:creator>mathiapeter</dc:creator>
      <guid isPermaLink="false">8266@/devforum/discussions</guid>
      <description><![CDATA[Hi, I had Nsight Visual Studio RC2 installed and everything worked fine, but now whet I try do download the final version 64bit, it shows that the installer has about 950MB but the downloading ends always at 51% so I can't run the installer. What would be the problem? I have tried 3 internet browsers but still the same.]]></description>
   </item>
      <item>
      <title>Cuda, PTX and Debugging symbols</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/6511/cuda-ptx-and-debugging-symbols</link>
      <pubDate>Wed, 28 Mar 2012 13:08:21 -0400</pubDate>
      <dc:creator>Bleiner</dc:creator>
      <guid isPermaLink="false">6511@/devforum/discussions</guid>
      <description><![CDATA[Hello everyone,<br /><br />I have a question about .ptx files and debugging in Visual studio 10 professional.  I am attempting to write some ptx code to integrate into Matlab.  However I am ONLY writing the  __global__ functions and I have not written a main function or any host code.  The functions are fairly simple, but what I want to do is be able to debug my code when matlab is running it.  I am compiling my code using nvcc -ptx 'functionname.cu'. and when I try to use nvcc -G -ptx 'functionname.cu' to get debugging information nothing else is returned except the .ptx file.  I should mention that I am relatively new to visual studio and that is why I am compiling using the command line.  <br /><br />I believe that if I had the symbols generated I could attach the process to Matlab and debug it.  Since .ptx is supposed to be just in time compiled code does it not allow me to have debugging symbols?  <br /><br />Any help would be greatly appreciated<br />Ben]]></description>
   </item>
      <item>
      <title>Breakpoint don&#039;t be hit using Nsight Visual Studio</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/8216/breakpoint-dont-be-hit-using-nsight-visual-studio</link>
      <pubDate>Tue, 15 May 2012 08:49:16 -0400</pubDate>
      <dc:creator>froure</dc:creator>
      <guid isPermaLink="false">8216@/devforum/discussions</guid>
      <description><![CDATA[Hi!<br /><br />I'm trying to debug with Nsight. I put some breakpoints but debugger don't stop the execution there. During the debugging, this breakpoints appers with a warning icon, and when I put the mouse over, it says: "The breakpoint will not currently bye hit. CUDA: no source correspondence for breakpoint".<br /><br />I try to find some solution inside the other forums, but I can't.<br /><br />Can you help me please?<br /><br />Thanks!! :D]]></description>
   </item>
      <item>
      <title>Linker Issue in building a CUDA application in Visual Studio 2010</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/4766/linker-issue-in-building-a-cuda-application-in-visual-studio-2010</link>
      <pubDate>Thu, 16 Feb 2012 03:56:49 -0500</pubDate>
      <dc:creator>gladman</dc:creator>
      <guid isPermaLink="false">4766@/devforum/discussions</guid>
      <description><![CDATA[Reposted from old forum.<br /><br />I am trying to build an application with Visual Studio 2010 and Nsight 2.1 hosted on Windows 7 (x64) using the 4.0 and/or the 4.1 build customisation rules.  <br /><br />I am using object file output format with the default filename for the output object files - $(IntDir)%(FileName)%(Extension).obj, which correctly generates all the object files in the $(IntDir) directory.   But after all the files have compiled correctly, the linker build step then fails with the message:<br /><br />LINK : fatal error LNK1181: cannot open input file '..\.obj'<br /><br />It appears that the file listing inputs for the linker is not being generated.<br /><br />If I perform the linker step manually, the application builds correctly and runs without problems. So it seems that there is an issue with the way the build customisations are producing the input file lists for the linker.   <br /><br />I would appreciate any advice that people can offer on what might be going wrong. <br />]]></description>
   </item>
      <item>
      <title>Kepler Quadro?</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/8236/kepler-quadro</link>
      <pubDate>Tue, 15 May 2012 15:43:41 -0400</pubDate>
      <dc:creator>Bleiner</dc:creator>
      <guid isPermaLink="false">8236@/devforum/discussions</guid>
      <description><![CDATA[Hello Everyone,<br /><br />Not to be a starting rumors, but has anyone heard anything about when the new Kepler Quadro cards will be coming out?   I am responsible for selecting new laptops for the design department at my company.  The 5010M looks to be a great card don't get me wrong, but with my code starting to rely more on CUDA, I want to get the most powerful GPU for double precision work I can find.  I would hate to drop a LARGE sum of money on a laptop only to have the GPU be obsolete a month later, especially when the GPU is a major component of the cost.  <br /><br />Any rumors?  Any employees around who can deny knowing anything while winking at the same time?<br /><br />Thanks<br />Ben]]></description>
   </item>
      <item>
      <title>Help adding external force to particles</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/8111/help-adding-external-force-to-particles</link>
      <pubDate>Sat, 12 May 2012 10:29:45 -0400</pubDate>
      <dc:creator>dartwing17</dc:creator>
      <guid isPermaLink="false">8111@/devforum/discussions</guid>
      <description><![CDATA[Hi,<br /><br />I have run a wind simulation in Flow-3D and have exported a set of wind-vectors, which describe<br />a position and a velocity. I want to use these wind-vectors in my particle system, so that when a particle is inside a wind-vector position (lets say +- 0.5), the velocity of the wind-vector should be added to the particle.<br /><br />I have implemented a way to do this, but it is very inefficient. I was just wondering if anyone had any suggestion on how to make the algorithm more efficient.<br /><br />This is my implementation (sorry for the ugly code):<br /><br />glm::vec3 checkWindPosition(glm::vec3 particlepos)<br />{<br />	for(int i = 0; i &lt; flowWindGrid.size(); i++)<br />	     if(particlepos.x &lt; flowWindGrid[i].position.x+0.5f &amp;&amp; particlepos.x &gt; flowWindGrid[i].position.x-0.5f<br />			&amp;&amp; particlepos.y &lt; flowWindGrid[i].position.y+0.5f &amp;&amp; particlepos.y &gt; flowWindGrid[i].position.y-0.5f<br />			&amp;&amp; particlepos.z &lt; flowWindGrid[i].position.z+0.5f &amp;&amp; particlepos.z &gt; flowWindGrid[i].position.z-0.5f)<br />			return flowWindGrid[i].velocity;<br />	return glm::vec3(99,99,99);<br />}<br /><br />void update()<br />{<br /> physx::PxParticleFluidReadData* rd = pf-&gt;lockParticleFluidReadData();<br /> if (rd-&gt;validParticleRange &gt; 0)<br /> {<br />	 physx::PxStrideIterator particleFlags(rd-&gt;flagsBuffer);<br />	// iterate over valid particle bitmap<br />	for (physx::PxU32 w = 0; w &lt;= (rd-&gt;validParticleRange-1) &gt;&gt; 5; w++)<br />	{<br />	   for (physx::PxU32 b = rd-&gt;validParticleBitmap[w]; b; b &amp;= b-1)<br />	   {<br />	      physx::PxU32 index = (w &lt;&lt; 5 | physx::lowestSetBit(b));<br />	      const physx::PxVec3&amp; position = rd-&gt;positionBuffer[index];<br /><br />	      if (particleFlags[index] &amp; physx::PxParticleFlag::eVALID)<br />	      {<br />                glm::vec3 newvelocity = checkWindPosition(glm::vec3(position.x,position.y,position.z));<br />		if(newvelocity != glm::vec3(99,99,99))<br />		{<br />		   windIndexBuffer.push_back(index);<br />		   windForce.push_back(physx::PxVec3(newvelocity.x,newvelocity.y,newvelocity.z));<br />		}<br />	      }<br />	   }<br />	}<br /> }<br /><br /> rd-&gt;unlock();<br /><br /> physx::PxStrideIterator forceBuffer(&amp;windForce[0]);<br /> physx::PxStrideIterator indexData(&amp;windIndexBuffer[0]);<br /> pf-&gt;addForces(windForce.size(), indexData, forceBuffer, physx::PxForceMode::eFORCE);<br />}]]></description>
   </item>
      <item>
      <title>ArrayFire + Nsight???</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/7996/arrayfire-nsight</link>
      <pubDate>Wed, 09 May 2012 21:08:48 -0400</pubDate>
      <dc:creator>sizheng</dc:creator>
      <guid isPermaLink="false">7996@/devforum/discussions</guid>
      <description><![CDATA[i'm trying ArrayFire, but it seems that i cannot debug arrayfire code by nsight~~~<br /><br />anybody knows how to?]]></description>
   </item>
      <item>
      <title>Parallel Nsight 2.2 RC2 - No Source Available</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/7886/parallel-nsight-2-2-rc2-no-source-available</link>
      <pubDate>Fri, 04 May 2012 16:35:28 -0400</pubDate>
      <dc:creator>wdrozd</dc:creator>
      <guid isPermaLink="false">7886@/devforum/discussions</guid>
      <description><![CDATA[When selecting CUDA Debugging with memory checker enabled I get a kernel crash with a window in  Nsight that says "No Source Available". When I click on the link "Browse to Find Source", a message says "The source code cannot be displayed".<br /><br />My application compiles fine, so clearly it can find the source (for both the C code and the Cuda code)<br /><br />Also I have no problem stopping at a breakpoint set in my Kernel prior to the crash (grid launch failure)<br /><br />The call-stack says "No active Cuda Kernels".<br /><br />Can you please me determine how to set Nsight to detect the source?<br /><br />Thanks.]]></description>
   </item>
      <item>
      <title>Can I incorporate volume data into PhyX?</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/7981/can-i-incorporate-volume-data-into-phyx</link>
      <pubDate>Wed, 09 May 2012 11:53:28 -0400</pubDate>
      <dc:creator>chloe5608</dc:creator>
      <guid isPermaLink="false">7981@/devforum/discussions</guid>
      <description><![CDATA[Hi, I'm new to PhysX and I'm trying to do some fluid simulation with volumetric data. I have my old SPH code written in C++ and the particles can collide with volume data. But I'm thinking of apply phsX for my simulation since I want to use more particles and my old implementation of SPH is not quite robust.<br /><br />Can I extract the property of particle(e.g. position, velocity, acceleration) and do the collision myself then send the data back? All the volume data is available on both CPU and GPU(texture).]]></description>
   </item>
      <item>
      <title>UI Composer studio</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/8171/ui-composer-studio</link>
      <pubDate>Mon, 14 May 2012 07:52:59 -0400</pubDate>
      <dc:creator>gandlaf</dc:creator>
      <guid isPermaLink="false">8171@/devforum/discussions</guid>
      <description><![CDATA[I started with the T20 Colibri board my development. As a part of the Demos software I found the UI Composer Demo. Now I am questioning from where I can get the UI Composer studio.<br />Thanks]]></description>
   </item>
      <item>
      <title>I want to display 3d pictures through 3D TV which is connected to my pc by HDMI port.</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/8161/i-want-to-display-3d-pictures-through-3d-tv-which-is-connected-to-my-pc-by-hdmi-port-</link>
      <pubDate>Mon, 14 May 2012 06:15:17 -0400</pubDate>
      <dc:creator>zhaiwenguo</dc:creator>
      <guid isPermaLink="false">8161@/devforum/discussions</guid>
      <description><![CDATA[I want to write a program to display 3D pictures(Or play 3D movies) through 3D TV which is connected to my PC by HDMI port. Just like NVIDIA 3D vision. Which APIs can I use? Thank you in advance. Are there some examples?<br />(GeForce GT 555M , Visual Studio 2008)]]></description>
   </item>
      <item>
      <title>Double Double or Arbitrary Precision</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/8141/double-double-or-arbitrary-precision</link>
      <pubDate>Sun, 13 May 2012 21:43:53 -0400</pubDate>
      <dc:creator>rdunn</dc:creator>
      <guid isPermaLink="false">8141@/devforum/discussions</guid>
      <description><![CDATA[I need to compare a high precision (i.e at least 2x that of double) with results from my CUDA kernels in double and single precision to establish how accurate they are compared to other implementations. <br /><br />I have struggled and failed at getting GMP/MPIR to build on a windows platform with visual studio 2010. I've tried building GMP/MPIR with mingw to no avail too. <br /><br />Does nVidia have a double-double library?, I saw a forum post to it (<a href="http://forums.nvidia.com/index.php?showtopic=218452" target="_blank" rel="nofollow">http://forums.nvidia.com/index.php?showtopic=218452</a>), but are unable to find the actual file. <br />]]></description>
   </item>
      <item>
      <title>Trouble with processing image in rows</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/8121/trouble-with-processing-image-in-rows</link>
      <pubDate>Sun, 13 May 2012 05:29:07 -0400</pubDate>
      <dc:creator>laz007</dc:creator>
      <guid isPermaLink="false">8121@/devforum/discussions</guid>
      <description><![CDATA[Hello!<br />I'm making an image filter that is processing the image in rows.<br />Two weeks I'm trying to figure out why it's not working when executed in parallel.<br />I use only threads in the Y dimension. Is that a problem?<br /><br /><br /><br />Here is part of the code:<br />BLOCKDIM_Y=16;<br />....<br />dim3 threads(1, BLOCKDIM_Y);<br />dim3 grid(1,  iDivUp(h, BLOCKDIM_Y));<br /><br />my_CUDA_filter&lt;&lt;&lt; grid, threads&gt;&gt;&gt;(sumR, sumG, sumB, mask,h,w, inD, outD, test);<br />...<br /><br />__global__ void my_CUDA_filter_simple222(int* sumR, int* sumG, int* sumB, int mask,int h,int w, u_int8_t *in, u_int8_t *out, int* test){<br />...<br />int iy = blockDim.y * blockIdx.y + threadIdx.y;<br />int ix=0;<br /><br />	if (iy&gt;=m &amp;&amp; iy&lt;(h-m)) {<br /><br />	//for(iy=m; iy&lt;h-m; iy++){<br /><br />	 ...<br />	for(ix=m+1;ix&lt;w-m;ix++){<br />	 ...<br />	 }<br />}<br /><br />The result image is messed up...<br />If I use for(iy=m; iy&lt;h-m; iy++){ <br />and run the kernel with one single thread (that means there is no parallelization) everything is OK.<br /><br />Any ideas?<br /><br />]]></description>
   </item>
      <item>
      <title>Please Help! I have two display adapters...</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/8086/please-help-i-have-two-display-adapters-</link>
      <pubDate>Fri, 11 May 2012 12:21:43 -0400</pubDate>
      <dc:creator>boomer88</dc:creator>
      <guid isPermaLink="false">8086@/devforum/discussions</guid>
      <description><![CDATA[Just bought a new Dell Inspiron 2320, it has integrated and dedicated video card. Under display adapters i see "Intel(R) HD Graphics Family" and "NVIDIA Geforce GT 525M", if i disable Intel driver, the Nvidia driver doesnt work. I need Nvidia to be responsible for my desktop display, as i cant calibrate colors properly with a crappy Intel HD controls. The colors look horrible and over saturated... i need it to be better for work. Thank.  ]]></description>
   </item>
      <item>
      <title>Linker error with c function in .cu file</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/7881/linker-error-with-c-function-in-cu-file</link>
      <pubDate>Fri, 04 May 2012 16:08:40 -0400</pubDate>
      <dc:creator>basementscientist</dc:creator>
      <guid isPermaLink="false">7881@/devforum/discussions</guid>
      <description><![CDATA[I've created a kernel inside a .cu file. Also inside the .cu file is a c++ function that calls<br />the kernal. Everything compiles ok, but on the final linking, the c++ function is not visible to the rest of the program. How do I make the function visible?<br /><br />I am using Visual Studio 2010 on Windows 8, and the newest SDK and Toolkit.]]></description>
   </item>
      <item>
      <title>Visual C++ 2010 with PhysX SDK</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/6816/visual-c-2010-with-physx-sdk</link>
      <pubDate>Mon, 09 Apr 2012 14:13:12 -0400</pubDate>
      <dc:creator>Kronus980</dc:creator>
      <guid isPermaLink="false">6816@/devforum/discussions</guid>
      <description><![CDATA[I was wondering if someone could help me set up the SDK with Visual C++ 2010? Having a lot of trouble doing this. Thanks!]]></description>
   </item>
      <item>
      <title>Brand new NVIDIA card!! What is the Error code: 3?</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/8056/brand-new-nvidia-card-what-is-the-error-code-3</link>
      <pubDate>Thu, 10 May 2012 15:08:08 -0400</pubDate>
      <dc:creator>renia</dc:creator>
      <guid isPermaLink="false">8056@/devforum/discussions</guid>
      <description><![CDATA[Hello everyone,<br />I will try to be as short as possible.<br />I have a brand new NVIDIA GeForce 8400MG 256 MB and running Windows 7 Professional x64.<br />I am an architect and using AutoCAD, Rhino, Photoshop etc.<br />The problem is that everyday since I bought this display card my computer stops workings. I see a black screen for a few seconds and then the following message:<br /><br />"The NVIDIA OpenGL driver detected a problem with the display driver and is unable to continue. The application must close. <br />Error code: 3"<br /><br />This has happened when I was working in photoshop but also when I just had my internet browser open. I have updated my drivers and really don't know what else to do.<br /><br />Thank you in advance]]></description>
   </item>
      <item>
      <title>Quadro FX3700 reporting &quot;GDI Generic&quot; version 1.1.0 ?!?</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/7836/quadro-fx3700-reporting-gdi-generic-version-1-1-0-</link>
      <pubDate>Thu, 03 May 2012 15:17:21 -0400</pubDate>
      <dc:creator>bsenftner</dc:creator>
      <guid isPermaLink="false">7836@/devforum/discussions</guid>
      <description><![CDATA[I have a combo web server / hi-compute server inside a Windows Server 2008 R2 machine. The physical machine is in a collocation, and I access it via the web sites it serves and via Remote Desktop Commection (RDC) when installing, configuring &amp; maintaining the system.<br /><br />I have a few programs which attempt to create OpenGL contexts, render into the framebuffer, copy the results, and write them to an image file. However, the rendered results are black or noise filled images.<br /><br />I installed the "Geeks3D GPU Caps Viewer" (build 1.16.0 from April 11, 2012) and it is reporting, from a RDC connection, GL_RENDERER: "GDI Generic", GL_VERSION: 1.1.0, GLSL version: N.A.<br /><br />It looks like something additional, or extra is required to get Accelerated 3D Rendering in a headless context under Windows Server 2008 R2. Can anyone point out any information or references to help out?  ]]></description>
   </item>
      <item>
      <title>[PhysX 3.2 beta-3] Cannot convert binary data from 32 to 64 bit</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/7551/physx-3-2-beta-3-cannot-convert-binary-data-from-32-to-64-bit</link>
      <pubDate>Wed, 25 Apr 2012 18:52:35 -0400</pubDate>
      <dc:creator>Budai</dc:creator>
      <guid isPermaLink="false">7551@/devforum/discussions</guid>
      <description><![CDATA[Hi,<br /><br />I have been experiencing an access violation when I deserialize a repx file, convert it from 32 to 64 bit, and deserialize the result on a 64-bit platform.<br /><br />I have attached a vs2010 project as a test case.  You might have to check the platform toolset on x64, as I'm using vs express.<br /><br />Thank you for your time,<br />Budai]]></description>
   </item>
      <item>
      <title>nvstereo.h</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/7991/nvstereo-h</link>
      <pubDate>Wed, 09 May 2012 17:41:39 -0400</pubDate>
      <dc:creator>colneilens</dc:creator>
      <guid isPermaLink="false">7991@/devforum/discussions</guid>
      <description><![CDATA[Is there an updated version of nvstereo.h for dx11? The one in SDK 11 only deals with DX9 and DX10?<br />Thanks,<br />Colin Neilens<br />Digital Extremes.]]></description>
   </item>
      <item>
      <title>GPU-aware hash cell size ???</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/7961/gpu-aware-hash-cell-size-</link>
      <pubDate>Wed, 09 May 2012 03:01:19 -0400</pubDate>
      <dc:creator>man82bs</dc:creator>
      <guid isPermaLink="false">7961@/devforum/discussions</guid>
      <description><![CDATA[Hi guys..<br /><br />Recently I am working on a project which uses hash table.<br /><br />And I am trying to use gpu for computations on the data in each cell.<br /><br />My problem is which cell size gives me best performance.<br /><br />For example, if there are normalized 1M points which has uniform distribution, <br /><br />how much points should be included in a cell?? <br /><br />Generally, heuristic experiments decide the cell size but I really need ideas.<br /><br />If there are anyone who have an idea, help me plz...<br /><br />Thanks.]]></description>
   </item>
      <item>
      <title>MassFX warning with 3ds Max scenes installed with PhysX</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/7946/massfx-warning-with-3ds-max-scenes-installed-with-physx</link>
      <pubDate>Tue, 08 May 2012 14:39:23 -0400</pubDate>
      <dc:creator>hotdoggyurkeyam</dc:creator>
      <guid isPermaLink="false">7946@/devforum/discussions</guid>
      <description><![CDATA[Hi, I get a warning when I open another max scene on another machine here at work, it says:<br /><br />“The MassFX authored in this file was created with a newer version of the plugins. Forward compatibility is not supported so things might not behave properly.”<br /><br />I am currently using PhysX for Max version: 2.7.1.0202.1800 and PhysX System Software 9.11.0621 abd I have noticed inside my Max install location, MassFX has been moved into a folder named "DISABLED_BY_NVIDIA_PHYSX_PLUGINS"<br /><br />Third party plugins installed for Max being used in the scene are: Rayfire 1.57.10; Orbaz Particle Flow Box <a href="/devforum/search?Search=%232&amp;Mode=like">#2</a> ver. 1.07 and <a href="/devforum/search?Search=%233&amp;Mode=like">#3</a> ver. 1.53. I’ve gotten this warning in scenes saved from another PC, opened in another, and not even using any PhysX or PhysX capable plugins.<br /><br />Thanks.]]></description>
   </item>
      <item>
      <title>Infrared(Nvidia) Box Office Set</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/7896/infrarednvidia-box-office-set</link>
      <pubDate>Fri, 04 May 2012 23:40:24 -0400</pubDate>
      <dc:creator>joonkp1976</dc:creator>
      <guid isPermaLink="false">7896@/devforum/discussions</guid>
      <description><![CDATA[Infrared communications device attached to back of your video card, plus infrared receptor with 3 colored video and stereo audio input and/or more then 1 of these, for TV and the home theater sound systems, plus software suite that comes with the video card that installs and works with these and the last remote controller that works on TV infrared connector unit.  Call it 'Nvidia Box Office Set' and price it at the box set price of let us say 10 dollars for the remote controller unit and 10 dollars more for the software suite and 5 dollars for the convinience, making it an attractive buy and convinient solution to PC to TV media trasfer...  Software suite that works with all the media formats and remote controller unit that can pause, rewind, fast forward, stop, play, etc where the TV unit will send wire- less infrared signals to do these options through the video card's communications device and then through video card then to the software suite...  Nvidia could be the first to venture into the wire-less infrared Home Box Office Set business and it would have to be Nvidia if you ask me since Nvidia video cards chop less on MMORPGs and that is why I am a supporter of Nvidia...  :&gt;  I hate the Ati video cards and lag chops.  Really do.  Passionately do...  I currently have a system with Ati and next one for sure will be an Nvidia system that I promise myself everyday.  Anyways, if anyone was interested with my idea of Infrared Box Office Set, leave me a line and if anyone from Nvidia sees this an interest, also leave me a line.  Thanks a bunch people and enjoy great computing~! <br /> \(-_-)/]]></description>
   </item>
      <item>
      <title>CUDA integration in vs 11 beta.</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/6771/cuda-integration-in-vs-11-beta-</link>
      <pubDate>Sat, 07 Apr 2012 19:04:21 -0400</pubDate>
      <dc:creator>m227</dc:creator>
      <guid isPermaLink="false">6771@/devforum/discussions</guid>
      <description><![CDATA[Hi,<br />Sorry if this question was asked before but here it is: is there a way to integrate CUDA in vs 11 beta the same way it is in vs 2010?<br />Thanks,<br />G. ]]></description>
   </item>
      <item>
      <title>Which version of the NVAPI should I use with my Quadro FX 570 card?</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/7876/which-version-of-the-nvapi-should-i-use-with-my-quadro-fx-570-card</link>
      <pubDate>Fri, 04 May 2012 15:19:38 -0400</pubDate>
      <dc:creator>braggo</dc:creator>
      <guid isPermaLink="false">7876@/devforum/discussions</guid>
      <description><![CDATA[Dell desktop, Windows 7<br />Nvidia Quadro FX 570<br /><br />I am currently using R295 February 2012 release of the NVAPI and some of the functions are returning an error code -9 NVAPI_INCOMPATIBLE_STRUCT_VERSION.<br /><br />Specifcally, the NvAPI_DISP_SetDisplayConfig() and NvAPI_DISP_GetDisplayConfig() when I pass in a NV_DISPLAYCONFIG_PATH_INFO struct.<br /><br />Here are my debug print outs:<br /><br />NvAPI_Initialize(): mStatus = NVAPI_OK<br />STRUCT Versions:<br />NV_DISPLAY_DRIVER_VERSION_VER                  = 65676<br />NV_DISPLAY_PORT_INFO_VER                       = 65580<br />NV_DISPLAYCONFIG_PATH_INFO_VER                 = 131100<br />NV_DISPLAYCONFIG_PATH_ADVANCED_TARGET_INFO_VER = 65664<br />NvAPI_SYS_GetChipSetInfo(&amp;mChipSetInfo): mStatus = NVAPI_OK<br /> -- ChipSet Info --<br />Device Id      = 10720<br />HBdeviceId     = 10720<br />Vendor Name    = Intel<br />struct version = 262376<br />NvAPI_GetInterfaceVersionString(): mStatus = NVAPI_OK<br />NvAPI Version = NVidia Complete Version 1.10<br />NvAPI_SYS_GetDriverAndBranchVersion(): mStatus = NVAPI_OK<br />GPU Driver Version = 27533<br />Branch String = r275_21<br /><br />Is the GPU driver version incompatible with the API version macros?<br /><br />Thank you in advance for your help!]]></description>
   </item>
      <item>
      <title>Issue using NSight 2.2 rc2 with VS11</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/7796/issue-using-nsight-2-2-rc2-with-vs11</link>
      <pubDate>Wed, 02 May 2012 15:19:27 -0400</pubDate>
      <dc:creator>diver182</dc:creator>
      <guid isPermaLink="false">7796@/devforum/discussions</guid>
      <description><![CDATA[Hi,<br /><br />I installed VS11 (Ultimate) beta on my Win7 machine,<br />after that the 4.2x SDK and then Parallel NSight 2.2 rc2.<br />The NSight installer claimed to have made modifications to the VS11 installation.<br />But I can neither find the NSight menu on the upper pane nor the templates<br />for creating a cuda 4.2 project.<br /><br />Do I have to adjust anything to make it work or what did I miss?]]></description>
   </item>
      <item>
      <title>Cuda Kernels Stop Running After Few Iterations</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/7381/cuda-kernels-stop-running-after-few-iterations</link>
      <pubDate>Sat, 21 Apr 2012 14:15:31 -0400</pubDate>
      <dc:creator>Eman</dc:creator>
      <guid isPermaLink="false">7381@/devforum/discussions</guid>
      <description><![CDATA[Hello,<br /><br />I am writing a code that calls a number of kernels inside a for loop. The number of the loop iterations is 1000. When I run the program, the kernels stop running after a number of iterations. I tried to use cudaGetLastError(); but it didn't give me any information as the output was "Error: unknown error". AS I increase the size of the blocks and the number of threads, the kernels stop running sooner. For example, when the block size is 8 it stopped at iteration 740, while when the size of the block is 16, it stopped at iteration 440.  In each iteration the same resources is being re-used so I really don't understand what is the problem!. <br /><br />Any help will be appreciated. <br /><br />Thanks, <br /> ]]></description>
   </item>
      <item>
      <title>GAI sample games don&#039;t work, GT240 CC1.2</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/7826/gai-sample-games-dont-work-gt240-cc1-2</link>
      <pubDate>Thu, 03 May 2012 08:07:03 -0400</pubDate>
      <dc:creator>Pawe</dc:creator>
      <guid isPermaLink="false">7826@/devforum/discussions</guid>
      <description><![CDATA[Hi,<br />I am working on parallel AI game-tree-search project and just wanted to run examples from NVIDIA GAI project as there is everything I would need. Unfortuanately sample applications do not work proparly on my PC (with CC=1.2, windows x64,  and latest drivers installed). Actually I can only run a game with CPU AI. When GPU is toggle only random player plays and always wins. I must be sure that these libraries work proparly so I can write similliar one for another game in my project. Please help!]]></description>
   </item>
      <item>
      <title>pfnIaSetIndexBuffer in NVIDIA display driver</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/7821/pfniasetindexbuffer-in-nvidia-display-driver</link>
      <pubDate>Thu, 03 May 2012 05:46:50 -0400</pubDate>
      <dc:creator>gzMarshall</dc:creator>
      <guid isPermaLink="false">7821@/devforum/discussions</guid>
      <description><![CDATA[Hi,<br /><br />I'm trying to render overlay in game via hooking the NVIDIA display driver. I hooked the pfnPresent and render the overlay in the function. When I called the NVIDIA's native function pfnIaSetIndexBuffer, it returns device removed error. The MSDN about the WDDM is too simple to debug. Does anyone in this forum know the issue?<br /><br />		D3D10DDIARG_CREATERESOURCE mtxResource;<br />		mtxResource.pInitialDataUP = NULL;<br />		mtxResource.ResourceDimension = D3D10DDIRESOURCE_BUFFER;<br />		mtxResource.Usage = D3D10_DDI_USAGE_DEFAULT;<br />		mtxResource.MapFlags = 0;<br />		mtxResource.MiscFlags = 0;<br />		mtxResource.Format = DXGI_FORMAT_UNKNOWN;<br />		mtxResource.SampleDesc.Count = 1;<br />		mtxResource.SampleDesc.Quality = 0;<br />		mtxResource.MipLevels = 1;<br />		mtxResource.ArraySize = 1;<br />		mtxResource.pPrimaryDesc = NULL;<br /><br />		D3D10DDI_MIPINFO indiciesSize;<br />		indiciesSize.PhysicalWidth = indiciesSize.TexelWidth = sizeof(ovlIndicies);<br />		indiciesSize.PhysicalDepth = indiciesSize.PhysicalHeight = indiciesSize.TexelDepth = indiciesSize.TexelHeight = 1;<br />		mtxResource.pMipInfoList = &amp;indiciesSize;<br /><br />		D3D10_DDIARG_SUBRESOURCE_UP indiciesInitialData;<br />		indiciesInitialData.pSysMem = ovlIndicies;<br />		indiciesInitialData.SysMemPitch = 0;<br />		indiciesInitialData.SysMemSlicePitch = 0;<br />		mtxResource.pInitialDataUP = &amp;indiciesInitialData;<br />	<br />		mtxResource.BindFlags = D3D10_DDI_BIND_INDEX_BUFFER;<br />		SIZE_T szIndicies = m_pDevice-&gt;m_deviceFuncs.pfnCalcPrivateResourceSize(m_pDevice-&gt;GetDrv(), &amp;mtxResource);<br />		m_hIndicies.pDrvPrivate = (void *)(new char[szIndicies]);<br /><br />		D3D10DDI_HRTRESOURCE hrtNull;<br />		hrtNull.handle = new char[szIndicies];<br /><br />		m_pDevice-&gt;m_deviceFuncs.pfnCreateResource(m_pDevice-&gt;GetDrv(), &amp;mtxResource, m_hIndicies, hrtNull);<br />                  <br />   m_pDevice-&gt;m_deviceFuncs.pfnIaSetIndexBuffer(m_pDevice-&gt;GetDrv(), m_hIndicies, DXGI_FORMAT_R32_UINT, 0);]]></description>
   </item>
      <item>
      <title>LNK2001 Undefined symbol not found</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/7801/lnk2001-undefined-symbol-not-found</link>
      <pubDate>Wed, 02 May 2012 15:58:17 -0400</pubDate>
      <dc:creator>ronthompson</dc:creator>
      <guid isPermaLink="false">7801@/devforum/discussions</guid>
      <description><![CDATA[I'm using cuda 4.1 and Visual Studio 2008 C++. I have several routines in C that are called from my main program in C++. The C and CUDA routines are compiled with nvcc and the C++ is compiled with the compiler supplied in VS 2008.  It all compiles fine. It won't link because it can't find the routines in C.  I use the extern "C" with the C routines in the global area of my main. It does not help. I believe it is linking using the VS2008 linker and not nvcc.  What am I doing wrong?<br /><br />Ron]]></description>
   </item>
      <item>
      <title>Shared memory usage</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/7761/shared-memory-usage</link>
      <pubDate>Wed, 02 May 2012 02:07:18 -0400</pubDate>
      <dc:creator>man82bs</dc:creator>
      <guid isPermaLink="false">7761@/devforum/discussions</guid>
      <description><![CDATA[Hi, I have a question about the basic usage for the shared memory.<br /><br />Each multiprocessor has own shared memory, right? <br />and my gpu has 48KB shared memory.<br /><br />Then, my question is, each multiprocessor has shared memory of 48KB or summation of 8 multiprocessor is 48KB.<br /><br />It's so confusing. <br /><br />Thanks in advance.<br /><br />cheers.]]></description>
   </item>
      <item>
      <title>HW Debug Support</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/7751/hw-debug-support</link>
      <pubDate>Tue, 01 May 2012 19:41:18 -0400</pubDate>
      <dc:creator>Vector</dc:creator>
      <guid isPermaLink="false">7751@/devforum/discussions</guid>
      <description><![CDATA[Which gpus have hardware debug support to use with NSight 2.2's single gpu debug feature?<br />Thanks<br />]]></description>
   </item>
      <item>
      <title>2 GTX265 cards not detected for debugging</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/6796/2-gtx265-cards-not-detected-for-debugging</link>
      <pubDate>Mon, 09 Apr 2012 01:11:30 -0400</pubDate>
      <dc:creator>vinaybgavirangaswamy</dc:creator>
      <guid isPermaLink="false">6796@/devforum/discussions</guid>
      <description><![CDATA[Hi,<br /><br />Thank you for reading this and taking time to help me configure for debugging<br /><br />I am trying to do code line by line debugging on a system with 2 gtx 265 cards. I have tried to list my system configuration below if thinking it might help<br /><br />Mother board gigabyte 990fxa-ud3<br />graphics card gtx265 (one from zotech and other is from asus)<br />OS: Windows 7 (64 bit)<br />graphics driver: devdriver_4.1_winvista-win7_64_286.19_general<br />CUDE toolkit: cudatoolkit_4.1.28_win_64<br />parallel nsight: Parallel_Nsight_Win64_2.1.0.12046<br />SLI: disabled<br />WDDM TDR enabled: false<br />wpf hardware acceleration: disabled by running registry file in nsight common folder<br /><br />I am not able to make other graphics card as headless as I do not have that option in nvidia controller. I have attached few pictures of what I see in the controller screen<br /><br />Please help me as I am trying to use this for a course project...<br /><br />Thank you in advance!]]></description>
   </item>
      <item>
      <title>SOLVED Problem in Optix and HD graphics ? No srgb support</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/7671/solved-problem-in-optix-and-hd-graphics-no-srgb-support</link>
      <pubDate>Sat, 28 Apr 2012 12:11:11 -0400</pubDate>
      <dc:creator>Conkal</dc:creator>
      <guid isPermaLink="false">7671@/devforum/discussions</guid>
      <description><![CDATA[Hello , <br /><br />I am programming a Bidirectional Path tracer in Optix and I have encountered a problem , which is also the same using the "Path tracer" sample from the Nvidia Optix Samples.<br /><br />(My computer is using intel i5-2500K (with iGpu intel HD-3000)and GTX560ti+ GTX460 , monitor is Samsung P2270, but the same happened in a couple of other computers. The drivers are the official ones with default parameters )<br /><br />The problem is this : <br /><br />1. When i connent my monitor to the GTX 560ti or GTX 460 graphics cards that i have, then the colors in the rendering are a lot brighter than when  connecting  the monitor to the iGPU of the processor. <br /><br />2. The result of the "printscreen function"(pressing s in the rendering window) is the same with the result in the igpu . <br /><br />What is happening ? Is it a bug from the sample code that copied by mistake or is it a problem with optix ? And more importantly WHICH RENDERING IS THE RIGHT ONE ?<br /><br /><br /><br />I have uploaded  the rendering of the default path tracer example and the same scene(almost) in mine.<br /><br />Each one has three pictures , printscreen using the gtx , another one using the intel HD and the optix output from the "s" command.]]></description>
   </item>
      <item>
      <title>Level of detail in Nsight profiling?</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/7621/level-of-detail-in-nsight-profiling</link>
      <pubDate>Thu, 26 Apr 2012 20:00:54 -0400</pubDate>
      <dc:creator>obuzko</dc:creator>
      <guid isPermaLink="false">7621@/devforum/discussions</guid>
      <description><![CDATA[Hi all,<br />I'm trying to optimize a fairly complex kernel that includes quite a few functions. Is Parallel Nsight capable of determining function-level performance bottlenecks within a kernel? <br />I'm using version 2.0 with VS2010 and a Tesla2050, and it appears to be limited to the kernel as a unit with no further breakdown. Can the latest version provide function-level profiling? If not, could someone suggest an alternative (if available)?<br />Thanks in advance<br /><br />Sasha<br />]]></description>
   </item>
      <item>
      <title>Running program in VS2010: Loading symbols from NVIDIA detoured.dll very slow</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/7521/running-program-in-vs2010-loading-symbols-from-nvidia-detoured-dll-very-slow</link>
      <pubDate>Wed, 25 Apr 2012 09:41:02 -0400</pubDate>
      <dc:creator>smash85</dc:creator>
      <guid isPermaLink="false">7521@/devforum/discussions</guid>
      <description><![CDATA[I have a program which I am working on my Dell Laptop XPS 502X. At work, I usually have another monitor plugged in my DVI port and everything is fine. When I work from home, I only have the laptop screen. When I start the program in VS2010, it is very slow and I notice in the bottom of the screen Loading symbols for ...\NVIDIA Corporation\coprocmanager\detourel.dll... which takes around 10-20 seconds every time. Any idea on why this is happening ?<br />]]></description>
   </item>
      <item>
      <title>Origin of CUDA_ERROR_INVALID_IMAGE when calling cuModuleLoad - invalid nvcc call?</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/5946/origin-of-cuda_error_invalid_image-when-calling-cumoduleload-invalid-nvcc-call</link>
      <pubDate>Thu, 15 Mar 2012 04:59:27 -0400</pubDate>
      <dc:creator>mkastrop</dc:creator>
      <guid isPermaLink="false">5946@/devforum/discussions</guid>
      <description><![CDATA[I always get the CUDA_ERROR_INVALID_IMAGE CUresult when calling cuModuleLoad on my .cubin-file. Even if I break it down to the following most simple kernel:<br /><br /><code><br />// minimal.cu<br /><br />__global__ void<br />minimal()<br />{<br />}<br /></code><br /><br />My compiler call looks like the following:<br /><br /><code><br />nvcc.exe -cubin -arch=sm_21 -o "minimal.cubin" "minimal.cu"<br /></code><br /><br />Its output looks good, doesn't it?<br /><br /><code><br />minimal.cu<br />minimal.cu<br />tmpxft_00000b38_00000000-3_minimal.cudafe1.gpu<br />tmpxft_00000b38_00000000-10_minimal.cudafe2.gpu<br /></code><br /><br />My system is a Win7 machine with one NVIDIA Quadro 600 in it. Can you please explain what I am doin' wrong? Unfortunately there are no concrete explanations to the CUDA error codes like I know them from OpenCL... That would be preferable.<br />]]></description>
   </item>
      <item>
      <title>286.16 driver crash in matlab running .ptx file</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/6516/286-16-driver-crash-in-matlab-running-ptx-file</link>
      <pubDate>Wed, 28 Mar 2012 13:15:17 -0400</pubDate>
      <dc:creator>Bleiner</dc:creator>
      <guid isPermaLink="false">6516@/devforum/discussions</guid>
      <description><![CDATA[Hello everyone,<br /><br />I am attempting to integrate some cuda code into my matlab work using .ptx code.  The code I have written so far appears to work initially.  This code is getting run in a loop with the input data changing each time.  After the first run it appears to have the correct answer.  When it hits the second time the driver crashes, and my screen flickers on and off for a second.  It then lets me know that the driver crashed but it recovered.  Then as far as matlab is concerned the GPU is busy until I reboot matlab.  <br /><br />The only thing I can think of is that it doesn't like the fact that the various inputs and outputs are changing in size.  On each pass the size of the input and output arrays are getting smaller.  I tried clearing the data off the GPU using both clear and reset, but it doesn't seem to make a difference.<br /><br />Any thoughts?<br />Ben<br />]]></description>
   </item>
      <item>
      <title>Why is a GTX680 even slower than a GTX480 when using CUDA?</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/7181/why-is-a-gtx680-even-slower-than-a-gtx480-when-using-cuda</link>
      <pubDate>Tue, 17 Apr 2012 07:39:41 -0400</pubDate>
      <dc:creator>nepluno</dc:creator>
      <guid isPermaLink="false">7181@/devforum/discussions</guid>
      <description><![CDATA[I've tested several Apps in the GPU Computing SDK, such as the GrabCutNPP. Surprisingly I found the  GTX680 is even slower than my old GTX480 (about 0.9x). Why could this happen? In contrast, the test on 3DMark11 reported that the GTX680 is 2x faster.<br /><br />The installed driver is 301.10, with a CUDA Toolkit 4.26. My OS is Windows 7 SP1. I even compile the code using compute_30 and sm_30, but the result kept the same.<br /><br />ps: I couldn't find a developer version driver that supports GTX680.]]></description>
   </item>
      <item>
      <title>Cannot find Reduce1.sln</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/7416/cannot-find-reduce1-sln</link>
      <pubDate>Mon, 23 Apr 2012 05:21:04 -0400</pubDate>
      <dc:creator>celebisait</dc:creator>
      <guid isPermaLink="false">7416@/devforum/discussions</guid>
      <description><![CDATA[Hi,<br /><br />I'm new at GPU programming and CUDA. I read the CUDA C Programming Guide. It was very helpful for me. And now I'm reading that tutorial* from Cyril Zeller. However, it says "Open up reduce\src\reduce1.sln" on the page 36/157, and I couldn't find that visual studio solution file.<br /><br />I have NVIDIA GPU Computing SDK 4.1. I searched in the SDK and found something at:<br /><br />"C:\ProgramData\NVIDIA Corporation\NVIDIA GPU Computing SDK 4.1\C\src\reduction"<br /><br />but I'm not sure is that the same thing with the PDF because it doesn't have the solution files separate like reduce1.sln, reduce2.sln etc.<br /><br />I will be appreciated for any help,<br />Sait.<br /><br />*<a href="http://people.maths.ox.ac.uk/~gilesm/hpc/NVIDIA/NVIDIA_CUDA_Tutorial_No_NDA_Apr08.pdf" target="_blank" rel="nofollow">http://people.maths.ox.ac.uk/~gilesm/hpc/NVIDIA/NVIDIA_CUDA_Tutorial_No_NDA_Apr08.pdf</a>]]></description>
   </item>
      <item>
      <title>PBO + glTexSubImage2D hangs</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/7491/pbo-gltexsubimage2d-hangs</link>
      <pubDate>Tue, 24 Apr 2012 18:02:43 -0400</pubDate>
      <dc:creator>attilaafra</dc:creator>
      <guid isPermaLink="false">7491@/devforum/discussions</guid>
      <description><![CDATA[I'm uploading frames into a texture using a pixel buffer object. I'm using an NVS 5100M and Windows 7 x64. With driver version 275.33 or older it works fine, but with 296.10, 296.35 and 301.27 the glTexSubImage2D call hangs for a few seconds. Sometimes it recovers, but usually TDR kicks in or even causes a BSOD. The problem is not specific to my code, and it can be easily reproduced using this PBO sample code: <a href="http://www.songho.ca/opengl/files/pboUnpack.zip" target="_blank" rel="nofollow">http://www.songho.ca/opengl/files/pboUnpack.zip</a><br /><br />Do you have any idea what's wrong? Many thanks!]]></description>
   </item>
      <item>
      <title>PhysX Lab unable to open .apb files</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/7456/physx-lab-unable-to-open-apb-files</link>
      <pubDate>Tue, 24 Apr 2012 09:55:49 -0400</pubDate>
      <dc:creator>dgermain</dc:creator>
      <guid isPermaLink="false">7456@/devforum/discussions</guid>
      <description><![CDATA[I can import .obj files, save them as .apb, use them in the sample demo without much problems.<br /><br />However I'm unable to open/import .apb files, neither the files that used in  the SimpleDestruction demo nor the files I created/exported with physx Lab.<br /><br />When I do an 'import 3d Mesh' and try opening a .apb file, I receive no error, but nothing happen in the editor as if I did not do anything.<br /><br />Is that a known issue, or I am the only one experiencing this problem ?<br /><br />Thanks,<br />DG<br />]]></description>
   </item>
      <item>
      <title>nppiHistogramRange_16u_C1R really slow</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/5301/nppihistogramrange_16u_c1r-really-slow</link>
      <pubDate>Thu, 01 Mar 2012 03:20:57 -0500</pubDate>
      <dc:creator>candreoli</dc:creator>
      <guid isPermaLink="false">5301@/devforum/discussions</guid>
      <description><![CDATA[I'm  trying to migrate from my IPP based  program to a NPP based program....i have to get the histogram of a 16 bit unsigned images but the function nppiHistogramRange_16u_C1R is really slow....for me it's faster to<br /><br />1) convert the image to 32f<br /><br />2) calculate the histogram using nppiHistogramRange_32f_C1R<br />and go on with my code....<br /><br />There is maybe some issue to fix?<br /><br /><br /><br />Thanks in advance, Carlo<br />]]></description>
   </item>
      <item>
      <title>Crash with the new LLVM compiler</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/6606/crash-with-the-new-llvm-compiler</link>
      <pubDate>Mon, 02 Apr 2012 06:41:37 -0400</pubDate>
      <dc:creator>Tofic</dc:creator>
      <guid isPermaLink="false">6606@/devforum/discussions</guid>
      <description><![CDATA[Hi,<br /><br />  OpenCL 1.1, drivers 296.10, GTX 580, 64-bits compiler, Windows 7 64-bits.<br /><br />  100% crash inside the compiler, when trying to compile this construction (well-compilable with the old compiler):<br /><strong>const struct BBox bbox = { (float4)(-.5f,-.5f,-.5f,0), (float4)(.5f,.5f,.5f,0) };<br /><br />	....</strong><br /><br />  Error: <em>OpenCL error 'Invalid binary': compilation error<br />	 ptxas application ptx input, line 13; error : Module-scoped variables in .local state space are not allowed with ABI</em><br /><br />       or<br /><br /><em>UNREACHABLE executed.</em><br /><br /><br />  Fix: remove the "const" modifier. Started with new LLVM compiler.<br /><br />Best wishes,<br />Anton]]></description>
   </item>
      <item>
      <title>openCL equivalent for &quot;cudaMallocPitch&quot;..?</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/7446/opencl-equivalent-for-cudamallocpitch-</link>
      <pubDate>Tue, 24 Apr 2012 03:01:59 -0400</pubDate>
      <dc:creator>HariniS</dc:creator>
      <guid isPermaLink="false">7446@/devforum/discussions</guid>
      <description><![CDATA[Hi every one,<br />I am very new to openCL and strated converting on eof the cuda code to opencl but got stuck in this part of code.My PC is having AMD processor with ATI graphics card which doesnt support openCL, but other codes are running fine here by falling back to CPU. someone please give me some suggestion so that i can get rid of this problem. the cuda code is<br /><br />size_t pitch = 0;	<br />	cudaError error = cudaMallocPitch( (void**)&amp;gpu_data, (size_t*)&amp;pitch, instances-&gt;cols * sizeof(float), instances-&gt;rows);	<br /><br />for( int i = 0; i &lt; instances-&gt;rows; i++ ){	<br />		///error = cudaMemcpy((void*)(gpu_data + (pitch/sizeof(float))*i), (void*)(instances-&gt;data + (instances-&gt;cols*i)), instances-&gt;cols * sizeof(float) ,cudaMemcpyHostToDevice);	<br /><br /><br />My converted openCL code is<br /><br />    gpu_data = clCreateBuffer(context, CL_MEM_READ_WRITE, ((instances-&gt;cols)*(instances-&gt;rows))*sizeof(float), NULL, &amp;ret);<br /><br />for( int i = 0; i &lt; instances-&gt;rows; i++ ){	<br />   ret = clEnqueueWriteBuffer(command_queue, gpu_data , CL_TRUE, 0, ((instances-&gt;cols)*(instances-&gt;rows))*sizeof(float),(void*)(instances-&gt;data + (instances-&gt;cols*i)) , 0, NULL, NULL);<br />if(ret != CL_SUCCESS)<br />	break;<br />}<br /><br /><br />some times it works fine but some times it stucks. But every time it stucks in reading part i.e.<br /><br />ret = clEnqueueReadBuffer(command_queue, gpu_data, CL_TRUE, 0,sizeof( float ) * instances-&gt;cols* 1 , instances-&gt;data, 0, NULL, NULL);<br /><br />where "gpu_data" is a device memory of type cl_mem and "instances" is a "matrix".<br />For both case its giving <br />Unhandled exception at 0x10001098 in CL_kmeans.exe: 0xC000001D: Illegal Instruction.<br />when break is pressed<br />No symbols are loaded for any call stack frame. The source code cannot be displayed.<br /><br />and in Call stack,<br /><br />&gt;	OCL8CA9.tmp.dll!10001098() 	<br /> 	[Frames below may be incorrect and/or missing, no symbols loaded for OCL8CA9.tmp.dll]	<br /> 	amdocl.dll!5c39de16() 	<br /><br />is displayed. Someone please help me in getting out of this.<br />Thanks in advance]]></description>
   </item>
      <item>
      <title>Issues with using CUDA specifically on a version of Linux, Mac or a particular Windows release?</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/191/issues-with-using-cuda-specifically-on-a-version-of-linux-mac-or-a-particular-windows-release</link>
      <pubDate>Mon, 29 Aug 2011 18:06:16 -0400</pubDate>
      <dc:creator>Nadeem Mohammad</dc:creator>
      <guid isPermaLink="false">191@/devforum/discussions</guid>
      <description><![CDATA[We have a comprehensive QA procedure to test all our supported configurations - but sometimes you may need to tweak your installation or there even some bugs or issues. Use these forums with the correct TAGs to ask a question or share some ideas.]]></description>
   </item>
      <item>
      <title>PhysX SDK 3.2: Physx Visual indicator shows PHYSX&gt;CPU</title>
      <link>http://forums.developer.nvidia.com/devforum/discussion/6956/physx-sdk-3-2-physx-visual-indicator-shows-physxcpu</link>
      <pubDate>Thu, 12 Apr 2012 08:32:16 -0400</pubDate>
      <dc:creator>buzzfan</dc:creator>
      <guid isPermaLink="false">6956@/devforum/discussions</guid>
      <description><![CDATA[This almost feels like a dumb question because I can't seem to find a definitive answer. I'm using PhysX SDK 3.2 and have a single GTX550Ti using PhysX v9.12 and 296.10 drivers. When I run either my own code or the SDK samples I see PHYSX&gt;CPU on the top left. I seem to recall that it used to show PHYSX&gt;GPU at some point in the past. Can someone tell me whether I should expect to see PhysX running on the GPU and if so, any ideas why it might not be working (with the 3.2 SDK samples at least).<br /><br />Thanks, Mark]]></description>
   </item>
      </channel>
</rss>
