what is meant by linearization of BVH and bucket count for SAH?

I am working on a CUDA ray tracer and doing some research work on it in which i am unable to find answer of few important questions:

What is the purpose of linearizing BVH during ray tracing? and
What is meant by bucket count for SAH (Surface area heuristics) binning …
What is meant by binning through centroid of primitive?