Right now it is not clear what causes the long runtime so there is no way to tell if any parameter setting might help.
I don't see how partitioning would help here - after all it is one single hierarchy that gets build up here and there is no way to know upfront, which parts of the hierarchy are stored in which partition.
Did you check CPU/memory utilization during the HierarchyCreate ?