Master Custom NLP Models: 5 Steps with Transformers & Python 2026

The modern enterprise, in 2026, confronts an unprecedented deluge of unstructured data. From nuanced customer feedback to highly specialized legal contracts and proprietary financial reports, generic Natural Language Processing (NLP) models, even large pre-trained ones, frequently falter when confronted with domain-specific jargon, contextual subtleties, or unique business objectives. This gap between generalized capability and bespoke requirement translates directly into delayed insights, operational inefficiencies, and missed competitive advantages. Bridging this gap demands a sophisticated approach to custom model development. This article demystifies the process, outlining a rigorous, 5-step methodology to master custom NLP model creation using state-of-the-art Transformers and Python, engineered for the demands and compute paradigms of 2026.

Technical Fundamentals: The Evolving Landscape of Custom NLP

At the core of contemporary NLP lies the Transformer architecture. Conceived in 2017, its profound impact on sequence-to-sequence tasks, particularly through the self-attention mechanism, has only amplified by 2026. The ability of Transformers to capture long-range dependencies and contextual relationships with unparalleled efficiency fundamentally altered how we approach language understanding.

The Transformer Paradigm: Beyond Encoder-Decoder

While the original Transformer featured both encoder and decoder stacks, the field rapidly diverged into specialized architectures:

Encoder-only models (e.g., BERT, RoBERTa, Electra): Excel at understanding text (NLU tasks like sentiment analysis, token classification). They process input sequences to generate rich contextual embeddings.
Decoder-only models (e.g., GPT, Llama-style models): Predominantly used for text generation (NLG tasks), predicting the next token in a sequence.
Encoder-Decoder models (e.g., T5, BART): Versatile for sequence-to-sequence tasks like machine translation, summarization, and question answering.

The strength of these pre-trained models, often trained on vast corpora, lies in transfer learning. They learn general language patterns, grammar, and world knowledge during pre-training. For custom applications, training a Transformer from scratch is almost universally inefficient and unnecessary due to the astronomical compute resources and proprietary datasets required. Instead, the established and highly effective paradigm is fine-tuning.

Fine-Tuning Reimagined: The Ascent of PEFT in 2026

Traditional full fine-tuning involves updating all parameters of a pre-trained Transformer model on a downstream, task-specific dataset. While effective, this approach presents several challenges in 2026:

Computational Cost: Updating billions of parameters requires substantial GPU memory and compute, making rapid experimentation costly.
Storage Overhead: Saving a full copy of a large model for each specific task accumulates rapidly.
Catastrophic Forgetting: Overwriting pre-trained weights can lead to losing general language understanding, especially with small task-specific datasets.

Enter Parameter-Efficient Fine-Tuning (PEFT), a cornerstone methodology for custom NLP in 2026. PEFT techniques selectively update only a small subset of the model's parameters, or introduce a small number of new parameters, while keeping the majority of the pre-trained weights frozen. This dramatically reduces computational overhead, storage requirements, and mitigates catastrophic forgetting.

Key PEFT methods prominent in 2026 include:

LoRA (Low-Rank Adaptation): Decomposes weight updates into low-rank matrices, significantly reducing the number of trainable parameters. LoRA modules are injected into the Transformer layers, and only these low-rank matrices are trained.
QLoRA (Quantized Low-Rank Adaptation): An extension of LoRA that quantizes the pre-trained model to 4-bit NormalFloat (NF4) or other low-bit formats, further reducing memory footprint while maintaining performance. This allows fine-tuning even multi-billion parameter models on consumer-grade GPUs.
Adapters: Small bottleneck modules inserted between Transformer layers. Only the adapter weights are trained, allowing for modularity and task-specific specialization without altering the base model.

The adoption of PEFT via libraries like Hugging Face's peft has democratized access to fine-tuning large models, making custom NLP solutions more accessible and cost-effective than ever before.

The 2026 Python NLP Stack

The ecosystem for building custom NLP models is robust and mature:

Hugging Face transformers (v5.x): The de facto standard for accessing, loading, and working with pre-trained Transformer models and tokenizers. Its Trainer API simplifies the fine-tuning process.
Hugging Face datasets (v2.x): An indispensable library for efficiently loading, preprocessing, and managing large datasets, often integrated seamlessly with transformers.
Hugging Face accelerate (v0.2x): Abstracts away the complexities of mixed-precision training, distributed training, and multi-GPU setups, allowing code to run effortlessly across different hardware configurations.
Hugging Face peft (v0.8x+): Provides implementations of LoRA, QLoRA, Adapters, and other PEFT methods, seamlessly integrating with transformers models.
bitsandbytes (v0.42x+): Essential for QLoRA, providing efficient 8-bit and 4-bit quantization routines for deep learning models.
PyTorch (v2.x) / TensorFlow (v2.x with Keras 3.x): The underlying deep learning frameworks, often chosen based on team preference. Hugging Face libraries provide framework-agnostic interfaces.

Practical Implementation: 5 Steps to Custom Financial Sentiment Analysis

Let's illustrate the process by building a custom financial sentiment analysis model. Our objective is to classify financial news headlines into Positive, Negative, or Neutral categories, specifically tuned for the nuances and jargon prevalent in the financial sector.

Step 1: Data Preparation – Curating and Tokenizing Domain-Specific Text

For a custom model, the dataset is paramount. We'll assume a dataset financial_sentiment.csv with columns text (financial headline) and sentiment (labeled positive, negative, neutral).

import pandas as pd
from datasets import Dataset, DatasetDict
from transformers import AutoTokenizer

# 1. Load your custom dataset
# In a real-world scenario, this might involve extensive data cleaning,
# augmentation, and expert labeling.
try:
    df = pd.read_csv("financial_sentiment.csv")
except FileNotFoundError:
    print("financial_sentiment.csv not found. Creating a dummy dataset.")
    data = {
        'text': [
            "XYZ Corp stock surged 15% on strong earnings report.",
            "Market plunges amid unexpected interest rate hike.",
            "Analyst maintains 'hold' rating on ABC Inc. shares.",
            "Strategic acquisition boosts company's long-term outlook.",
            "Supply chain disruptions weigh heavily on quarterly results.",
            "New product launch receives mixed reviews, stock unchanged.",
            "Government stimulus package expected to buoy consumer spending.",
            "Bankruptcy filing sends shockwaves through the sector.",
            "Consolidated revenues beat estimates, pushing shares higher.",
            "Regulatory fines impact Q3 profitability significantly."
        ],
        'sentiment': [
            "positive",
            "negative",
            "neutral",
            "positive",
            "negative",
            "neutral",
            "positive",
            "negative",
            "positive",
            "negative"
        ]
    }
    df = pd.DataFrame(data)
    df.to_csv("financial_sentiment.csv", index=False)

# Map sentiment labels to numerical IDs
label_to_id = {"negative": 0, "neutral": 1, "positive": 2}
id_to_label = {v: k for k, v in label_to_id.items()}
df['labels'] = df['sentiment'].map(label_to_id)

# Convert to Hugging Face Dataset format
hf_dataset = Dataset.from_pandas(df)

# Split into train and test sets
train_test_split = hf_dataset.train_test_split(test_size=0.2, seed=42)
train_dataset = train_test_split['train']
test_dataset = train_test_split['test']

# Choose a suitable pre-trained tokenizer. FinBERT is often a good choice for finance.
# We'll use a general large BERT model for demonstration to show full fine-tuning capability with PEFT.
model_checkpoint = "bert-large-uncased"
tokenizer = AutoTokenizer.from_pretrained(model_checkpoint)

def tokenize_function(examples):
    # Ensure truncation and padding are handled correctly for model input
    return tokenizer(examples["text"], truncation=True, padding="max_length", max_length=128)

# Apply tokenization to the datasets
tokenized_train_dataset = train_dataset.map(tokenize_function, batched=True)
tokenized_test_dataset = test_dataset.map(tokenize_function, batched=True)

# Select and rename columns for the Trainer
tokenized_train_dataset = tokenized_train_dataset.remove_columns(["text", "sentiment"])
tokenized_test_dataset = tokenized_test_dataset.remove_columns(["text", "sentiment"])
tokenized_train_dataset.set_format("torch")
tokenized_test_dataset.set_format("torch")

print("Dataset prepared and tokenized:")
print(f"Train dataset size: {len(tokenized_train_dataset)}")
print(f"Test dataset size: {len(tokenized_test_dataset)}")
print(f"Sample tokenized input: {tokenized_train_dataset[0]}")

Why this step is crucial: High-quality, domain-specific labeled data is the bedrock of any successful custom NLP model. The datasets library provides an efficient way to manage and preprocess data, especially critical for larger-than-memory datasets. Tokenization ensures raw text is converted into numerical inputs suitable for Transformer models, with truncation and padding managing sequence lengths.

Step 2: Model Loading and Configuration – Adapting a Pre-trained Giant

We'll load a pre-trained Transformer model from Hugging Face and configure it for our 3-class sentiment classification task.

from transformers import AutoModelForSequenceClassification, TrainingArguments, Trainer
import torch
import numpy as np
from sklearn.metrics import accuracy_score, f1_score

# Load the base model with a classification head
# Configure the number of labels and provide label mappings
model = AutoModelForSequenceClassification.from_pretrained(
    model_checkpoint,
    num_labels=len(label_to_id),
    id2label=id_to_label,
    label2id=label_to_id
)

# Define evaluation metrics
def compute_metrics(p):
    predictions = np.argmax(p.predictions, axis=1)
    return {
        "accuracy": accuracy_score(p.label_ids, predictions),
        "f1_macro": f1_score(p.label_ids, predictions, average="macro"),
    }

print(f"Base model '{model_checkpoint}' loaded with classification head.")

Why this step is crucial: AutoModelForSequenceClassification correctly loads the pre-trained Transformer weights and attaches a randomly initialized classification head (a simple dense layer) on top. This head is the part that will learn to map the Transformer's contextual embeddings to our specific sentiment labels during fine-tuning.

Step 3: PEFT Integration – Efficiently Adapting for Custom Needs (LoRA Example)

This is where the 2026 methodology truly shines. We integrate LoRA to make fine-tuning extremely efficient.

from peft import LoraConfig, get_peft_model, TaskType

# Define LoRA configuration
# r: The rank of the update matrices, a lower rank means fewer trainable parameters. Common values: 8, 16, 32, 64.
# lora_alpha: Scaling factor for the LoRA weights.
# target_modules: The names of the layers in the Transformer model to which LoRA modules will be applied.
#                 Typically, these are the attention projection layers (query, key, value) and sometimes output projections.
# lora_dropout: Dropout probability for LoRA layers.
peft_config = LoraConfig(
    task_type=TaskType.SEQ_CLS, # Specify the task type
    inference_mode=False,
    r=16, # Increased rank for better expressivity on a large model
    lora_alpha=32, # A common value that scales LoRA updates.
    lora_dropout=0.1,
    target_modules=["query", "value", "key", "dense"], # Target attention and feed-forward layers
    bias="none"
)

# Apply LoRA to the base model
model = get_peft_model(model, peft_config)

# Print the number of trainable parameters
model.print_trainable_parameters()

print("PEFT (LoRA) configured and applied to the model.")

Why this step is crucial: LoraConfig specifies how LoRA should be applied (rank, alpha, dropout, target modules). get_peft_model then modifies the loaded base model, replacing targeted layers with LoRA-enabled versions. The print_trainable_parameters() output will show a drastic reduction in trainable parameters compared to full fine-tuning, demonstrating the efficiency gain.

Step 4: Training Setup – Orchestrating the Fine-Tuning Process

We configure the TrainingArguments and initialize the Trainer, which handles the entire training loop.

# Configure training arguments
# output_dir: Where to save model checkpoints and logs
# learning_rate: PEFT often allows higher learning rates
# per_device_train_batch_size, per_device_eval_batch_size: Adjust based on GPU memory
# num_train_epochs: Number of passes over the training data
# weight_decay: L2 regularization
# fp16/bf16: Enable mixed precision training for speed and memory efficiency (critical for 2026)
training_args = TrainingArguments(
    output_dir="./custom_financial_sentiment_model",
    learning_rate=2e-4, # Higher LR for PEFT is often effective
    per_device_train_batch_size=8, # Adjust based on your GPU
    per_device_eval_batch_size=8,
    num_train_epochs=5,
    weight_decay=0.01,
    evaluation_strategy="epoch",
    save_strategy="epoch",
    load_best_model_at_end=True,
    metric_for_best_model="f1_macro",
    push_to_hub=False, # Set to True to upload to Hugging Face Hub
    report_to="none", # You can set this to "wandb" or "tensorboard" for better logging
    fp16=torch.cuda.is_available() # Use FP16 for speed if GPU is available
)

# Initialize the Hugging Face Trainer
trainer = Trainer(
    model=model,
    args=training_args,
    train_dataset=tokenized_train_dataset,
    eval_dataset=tokenized_test_dataset,
    tokenizer=tokenizer,
    compute_metrics=compute_metrics,
)

print("Trainer initialized. Starting fine-tuning...")

# Start training!
trainer.train()

print("Fine-tuning complete. Best model saved.")

Why this step is crucial: TrainingArguments defines all hyperparameters and behaviors of the training process. The Trainer abstracts away the complexities of the training loop, including gradient accumulation, logging, evaluation, and saving checkpoints. Using fp16 (mixed precision) is a standard practice in 2026 for performance on modern GPUs.

Step 5: Evaluation and Inference – Putting the Custom Model to Work

After training, evaluate the model's performance on the test set and demonstrate how to use it for new predictions.

# Evaluate the fine-tuned model
results = trainer.evaluate()
print("\nEvaluation Results:")
for key, value in results.items():
    print(f"- {key}: {value:.4f}")

# Save the PEFT adapter weights (only the LoRA layers)
# This saves a tiny fraction of the full model size
trainer.model.save_pretrained("./custom_financial_sentiment_model_peft_adapter")
tokenizer.save_pretrained("./custom_financial_sentiment_model_peft_adapter")

print("PEFT adapter weights and tokenizer saved.")

# --- Inference Example ---
from peft import PeftModel, PeftConfig

# Load the PEFT adapter
peft_adapter_path = "./custom_financial_sentiment_model_peft_adapter"
peft_config = PeftConfig.from_pretrained(peft_adapter_path)

# Load the base model again
base_model = AutoModelForSequenceClassification.from_pretrained(
    peft_config.base_model_name_or_path,
    num_labels=len(label_to_id),
    id2label=id_to_label,
    label2id=label_to_id
)

# Load the PEFT adapter onto the base model
inference_model = PeftModel.from_pretrained(base_model, peft_adapter_path)
inference_model.eval() # Set to evaluation mode
inference_tokenizer = AutoTokenizer.from_pretrained(peft_adapter_path)

print("\nModel ready for inference.")

# Example new financial headlines
new_headlines = [
    "Tech startup valuation soars after Series C funding round.",
    "Crude oil prices plummet due to unexpected inventory build.",
    "Company X announces strategic partnership, stock shows no immediate change."
]

for headline in new_headlines:
    inputs = inference_tokenizer(headline, return_tensors="pt", truncation=True, padding="max_length", max_length=128)
    with torch.no_grad():
        outputs = inference_model(**inputs)
    logits = outputs.logits
    predictions = torch.argmax(logits, dim=-1).item()
    predicted_sentiment = id_to_label[predictions]
    print(f"Headline: '{headline}' -> Predicted Sentiment: {predicted_sentiment}")

Why this step is crucial: Evaluation metrics provide quantitative proof of the model's effectiveness. Saving only the PEFT adapter weights (a few MBs) rather than the full base model (GBs) is a massive advantage for deployment and version control. The inference demonstration shows the seamless application of the fine-tuned model for real-world predictions.

💡 Expert Tips: From the Trenches

Navigating custom NLP model development effectively in 2026 requires more than just following steps. Here are insights gleaned from production deployments:

Strategic Data Augmentation: For scarce domain-specific data, standard augmentation techniques like back-translation (using services like DeepL or Google Translate for translating text to another language and back) or synonym replacement are effective. More advanced strategies involve using a larger Generative AI model (e.g., GPT-4o, Llama 3) to generate synthetic, labeled examples that mimic your dataset's distribution. Ensure human review for quality.
Base Model Selection Matters: Don't just default to bert-base-uncased. For highly specialized domains, investigate models pre-trained on similar corpora (e.g., FinBERT for finance, BioBERT for biomedicine). For multilingual tasks, consider XLM-R or mBERT. Prioritize models that align semantically with your target domain.
Hyperparameter Tuning with Purpose: For PEFT, r (rank) and lora_alpha are critical. While r=8 or 16 is a good starting point, experiment to find the optimal trade-off between parameter count and model expressivity. Leverage tools like Optuna or integrated solutions like Weights & Biases with accelerate for efficient hyperparameter search, focusing on your primary evaluation metric (e.g., F1-macro for imbalanced classification).
Deployment for Scale and Efficiency (2026 Outlook):
- Quantization: Post-training quantization (e.g., 8-bit or 4-bit using optimum library) can significantly reduce model size and accelerate inference on CPU or specialized hardware (e.g., NVIDIA T4, Jetson).
- ONNX Export: Convert your PyTorch or TensorFlow model to the ONNX (Open Neural Network Exchange) format. This enables cross-platform deployment and optimization with runtimes like ONNX Runtime, often yielding 2-5x faster inference.
- Inference Servers: Utilize robust inference servers like NVIDIA Triton Inference Server or KServe (part of Kubeflow) for managing model versions, A/B testing, batching, and dynamic scaling in production. Libraries like vLLM are also gaining traction for high-throughput generation tasks.
- Edge Deployment: For embedded systems or low-latency applications, consider TensorFlow Lite or PyTorch Mobile after quantization.
Ethical AI and Bias Mitigation: Custom models, especially in sensitive domains, inherit biases from both the base model and the custom training data.
- Auditing Data: Conduct thorough bias audits on your training data (e.g., for gender, racial, or socioeconomic bias).
- Bias Detection Tools: Employ tools like IBM's AI Fairness 360 or Google's What-If Tool during development to identify and quantify biases.
- Regular Monitoring: Post-deployment, implement continuous monitoring for fairness and performance degradation.
Version Control for Models and Data: Treat your custom model artifacts (weights, configurations, tokenizers) and datasets as first-class citizens in your version control system (e.g., Git LFS for large files, DVC for data). This ensures reproducibility and traceability.

Comparison: Approaches to Custom NLP Model Development

Here's a comparison of prominent strategies for tailoring NLP models to specific needs in 2026:

⚙️ Full Fine-tuning (Traditional Approach)

✅ Strengths

🚀 Performance Ceiling: Can achieve the highest possible performance on tasks where a large amount of high-quality labeled data is available.
✨ Comprehensive Adaptation: Every parameter of the model is updated, allowing for deep adaptation to the new domain and task.

⚠️ Considerations

💰 High Resource Cost: Requires significant GPU memory and compute for training, making it expensive for large models (e.g., >7B parameters).
💾 Storage & Deployment: Each fine-tuned model is a full copy of the base model (several GBs), leading to substantial storage overhead and slower deployment.
📉 Catastrophic Forgetting: Higher risk of losing general knowledge if fine-tuned on small, highly specialized datasets.

⚡ Parameter-Efficient Fine-Tuning (PEFT - e.g., LoRA, QLoRA, Adapters)

✅ Strengths

🚀 Resource Efficiency: Dramatically reduces trainable parameters, enabling fine-tuning of multi-billion parameter models on consumer-grade GPUs. Significantly less memory and compute.
✨ Rapid Experimentation: Faster training cycles and smaller adapter sizes (MBs vs. GBs) facilitate quicker iteration and A/B testing of different PEFT configurations.
🛡️ Mitigated Forgetting: By keeping most base model weights frozen, PEFT techniques are less prone to catastrophic forgetting.
💾 Modular Deployment: Adapters can be easily swapped or stacked, allowing a single base model to serve multiple tasks with tiny, task-specific additions.

⚠️ Considerations

📈 Potential Performance Gap: While often comparable, PEFT might sometimes fall slightly short of full fine-tuning on extremely dense, high-resource datasets.
🔧 Configuration Complexity: Requires careful tuning of PEFT-specific hyperparameters (e.g., LoRA rank r, lora_alpha).

🗣️ Prompt Engineering / In-Context Learning (with Powerful LLMs)

✅ Strengths

🚀 Zero/Few-Shot Capability: Can achieve impressive results on new tasks without any fine-tuning data, relying solely on well-crafted prompts and the LLM's vast pre-training.
✨ Flexibility & Agility: Rapid iteration by simply changing the prompt, no model re-training required. Ideal for quick prototypes or tasks with minimal data.

⚠️ Considerations

💰 Inference Cost: Relying on API calls to large proprietary LLMs can be expensive for high-volume inference.
📉 Performance Variability: Performance can be highly sensitive to prompt wording and might not match fine-tuned models on complex or highly nuanced tasks.
🔒 Data Privacy & Security: Sending proprietary data to third-party LLM APIs raises concerns about data leakage and security for sensitive applications.
📏 Context Window Limitations: Limited by the LLM's context window, making it unsuitable for processing very long documents in a single prompt.

🏗️ Training from Scratch (Rarely Justified)

✅ Strengths

🚀 Ultimate Customization: Full control over architecture, pre-training objectives, and data.
✨ Domain Purity: Ensures the model learns exclusively from your highly specialized, often proprietary, domain corpus.

⚠️ Considerations

💰 Exorbitant Cost: Requires massive compute resources (hundreds of GPUs for weeks/months) and extremely large, diverse datasets. Impractical for most organizations.
📉 Time & Expertise: Demands significant time, specialized expertise, and a dedicated MLOps team.
📈 Performance Gap: Often underperforms fine-tuned models unless the 'from scratch' training is done on a scale comparable to foundational models.

Frequently Asked Questions (FAQ)

Q: When should I choose PEFT over full fine-tuning in 2026? A: Almost always. Choose PEFT when GPU memory or training time is a constraint, when you need to fine-tune large models (e.g., >7B parameters), when you have limited labeled data for your specific task, or when you need to manage multiple task-specific models efficiently. Full fine-tuning is only marginally superior for tasks with immense, high-quality datasets and when maximum possible performance is the absolute only metric, irrespective of cost.

Q: How much data do I need for effective custom model training with PEFT? A: The amount of data varies significantly by task complexity and the base model's domain alignment. With PEFT and a strong pre-trained base model, even hundreds or a few thousands of high-quality, labeled examples can yield substantial improvements over zero-shot performance. For robust production systems, aim for several thousands to tens of thousands.

Q: What are the common pitfalls in deploying custom NLP models in 2026? A: Key pitfalls include neglecting robust MLOps practices (monitoring for drift, performance, and bias), inadequate infrastructure for scalable inference, underestimating the need for continuous model retraining, and overlooking data privacy and security requirements in production environments, especially when integrating with sensitive business data.

Q: Can I use these techniques for multilingual NLP tasks? A: Absolutely. The principles remain the same. Start with a multilingual base model (e.g., XLM-RoBERTa, mBERT), ensure your custom dataset is appropriately labeled in the target languages, and then apply PEFT methods. The Hugging Face ecosystem fully supports multilingual models and tokenizers.

Conclusion and Next Steps

The ability to create highly specialized NLP models is no longer the exclusive domain of hyper-scale tech giants. With the maturity of Transformers, the efficiency breakthroughs of PEFT, and the comprehensive Python ecosystem, every organization can now craft custom solutions to unlock deep insights from their unique data. The 5-step process outlined – from rigorous data preparation to efficient PEFT integration and thoughtful deployment – forms a robust blueprint for developing high-impact NLP applications in 2026.

I encourage you to clone the provided code, adapt it to your specific datasets, and begin experimenting. The real value of these advanced techniques becomes apparent through hands-on application. Share your insights, challenges, and successes in the comments below, contributing to the collective knowledge of our evolving NLP community.

Master Custom NLP Models: 5 Steps with Transformers & Python 2026

Technical Fundamentals: The Evolving Landscape of Custom NLP

The Transformer Paradigm: Beyond Encoder-Decoder

Fine-Tuning Reimagined: The Ascent of PEFT in 2026

The 2026 Python NLP Stack

Practical Implementation: 5 Steps to Custom Financial Sentiment Analysis

💡 Expert Tips: From the Trenches

Comparison: Approaches to Custom NLP Model Development

⚙️ Full Fine-tuning (Traditional Approach)

⚡ Parameter-Efficient Fine-Tuning (PEFT - e.g., LoRA, QLoRA, Adapters)

🗣️ Prompt Engineering / In-Context Learning (with Powerful LLMs)

🏗️ Training from Scratch (Rarely Justified)

Frequently Asked Questions (FAQ)

Conclusion and Next Steps

Related Articles

Carlos Carvajal Fiamengo

🎁 Exclusive Gift for You!

Related Articles

Mastering Dirty Data: Cleaning & Preparing Datasets for ML in 2026

Micro-frontends with Module Federation: Scaling JS for Big Teams in 2026

Terraform 101: Your 2026 Intro to Infrastructure as Code for Cloud