VFP & NEON on TK1

sanket.kanzarkar · July 2, 2018, 5:21am

I have recently started working arm cortex-a15 on board Jetson TK1. Where I have to optimize code. As on my algorithm i am getting following results.

Running on 1Ghz freq 5-6 fps & Running on 2GHz freq 10-11 fps

But i want to optimize it upto minimum 18-19 fps at 1GHZ. I have read lot of things related to VFP & NEON. And i found VFP is not a parallel architecture like NEON.Even Mixing NEON and VFP instructions will give poor performance.

And in my algorithm most of the functionality is sequentially dependent. i.e, they are interdependent.

I have not implemented VFP or NEON in my algo. I want to know how could i use VFP and NEON in my algorithm to optimize it.

kayccc · July 2, 2018, 5:31am

Hi sanket, kanzarkar,

Not sure the details of your algorithm, but please check below topic if it’s helpful: [url]https://devtalk.nvidia.com/default/topic/1021997/[/url]

Thanks

sanket.kanzarkar · July 2, 2018, 5:58am

can you tell me g++ options for tk1 in order to properly use VFP or NEON

linuxdev · July 2, 2018, 6:42pm

I don’t know about VFP, but the NEON with hard float calling convention is:

-march=armv7-a -mfpu=neon -mfloat-abi=hard

If you are compiling natively you won’t need the armv7-a, nor the mfloat-abi since this is the default. The “-mfpu-neon” is the one which makes the NEON available, but you still have to use NEON in your code.

EDIT: Just noticed you are talking about hardware floating point when you used VFP acronym (I sometimes suffer from “acronym psychosis”). In ARMv7-a the older hardware didn’t support a hardware floating point and it was soft (“software”) floating point. If you were to install a cross compiler on a PC you’d look for the “armhf” in the compiler name, and if the compiler has that, then it is able to use hardware floating point instructions (it’s a calling convention on how return values are used related to using software methods or the hardware floating point unit). You won’t need a separate command line argument for the compiler to use or recognize hardware floating point if the compiler itself is correct. For naming purposes “armhf” is the “E-ABI” calling convention using “hardware floating point”. This is what the compilers are in the TK1 and the one installed on host if you ran JetPack for the host itself. The Ubuntu cross arch arm32 compilers also have armhf available (and if you install to host via JetPack this is likely part of what you will get).

One more edit…you can explore options with “man gcc” or “man g++”. The “/” key searches for regular expression terms in man pages, so for example you can “/float-abi”.

Topic		Replies	Views
compilation command for vfpv4 on tk1 Jetson TK1	1	471	July 2, 2018
How can I get high computer speed of Jetson TK1 when using arm neon Jetson TK1	8	1903	October 18, 2021
GCC options for Tegra K1 Jetson TK1	1	1621	July 18, 2016
Question about NEON and VFP on the TX2 platform Jetson TX2	1	479	August 27, 2018
ARM NEON support Legacy PGI Compilers	8	5901	June 13, 2012
ARM Assembler on Jetson TK1 Jetson TK1	4	3845	December 17, 2014
How are floating point operations handled on NVIDIA Jetson TK1? Jetson TK1	3	2200	July 22, 2015
nvcc compilation problem Jetson TK1	4	1583	March 14, 2016
How to using VFPv4 like NEON intrinsics on Jetson TX2 ? Jetson TX2	3	1464	August 29, 2018
Cross-compile using NEON and floating point Jetson TX2	5	1963	October 18, 2021

VFP & NEON on TK1

Related topics