* Make HF implementation match OLMo models for lower precisions * Add test of 1B logits in bfloat16 * Run make fixup