Accelerating Large Language Models with Habana Gaudi2