INTRODUCTION TO INFERENCE TECHNIQUESPROMPT ENGINEERINGCACHING WITH VECTOR STORESCHAINS FOR LONG DOCUMENTSSUMMARIZATIONBATCH PROMPTING FOR EFFICIENT INFERENCEMODEL OPTIMIZATION METHODSPARAMETER‐EFFICIENT FINE‐TUNING METHODSCOST AND PERFORMANCE IMPLICATIONSSUMMARYREFERENCES