--- name: advanced-analytics description: Advanced analytics including machine learning, predictive modeling, and big data techniques version: "2.0.0" sasmp_version: "2.0.0" bonded_agent: 06-advanced-analytics-specialist bond_type: PRIMARY_BOND # Skill Configuration config: atomic: true retry_enabled: true max_retries: 3 backoff_strategy: exponential model_training_timeout: 3600 # Parameter Validation parameters: skill_level: type: string required: true enum: [intermediate, advanced, expert] default: intermediate focus_area: type: string required: false enum: [regression, classification, clustering, timeseries, feature_engineering, all] default: all deployment_target: type: string required: false enum: [notebook, api, batch, realtime] default: notebook # Observability observability: logging_level: info metrics: [model_accuracy, training_time, prediction_latency, feature_importance] model_versioning: true --- # Advanced Analytics Skill ## Overview Master advanced analytics techniques including machine learning, predictive modeling, and big data processing for sophisticated data analysis. ## Core Topics ### Machine Learning Fundamentals - Supervised vs unsupervised learning - Classification algorithms (logistic regression, decision trees, random forest) - Regression algorithms (linear, polynomial, ensemble methods) - Clustering (K-means, hierarchical, DBSCAN) ### Predictive Analytics - Time series forecasting (ARIMA, exponential smoothing) - Customer segmentation and RFM analysis - Churn prediction models - A/B testing and experimentation ### Big Data Technologies - Introduction to Spark and PySpark - Data lakes and data mesh concepts - Cloud analytics platforms (AWS, GCP, Azure) - Real-time analytics with streaming data ### Advanced Techniques - Feature engineering best practices - Model validation and cross-validation - Hyperparameter tuning - Model deployment considerations ## Learning Objectives - Build and validate machine learning models - Implement predictive analytics solutions - Work with big data technologies - Apply advanced statistical techniques ## Error Handling | Error Type | Cause | Recovery | |------------|-------|----------| | Overfitting | Model too complex | Add regularization, reduce features | | Underfitting | Model too simple | Add features, increase complexity | | Data leakage | Target info in features | Review feature engineering pipeline | | Class imbalance | Skewed target | Use SMOTE, class weights, or resampling | | Convergence failure | Poor hyperparameters | Grid search, adjust learning rate | ## Related Skills - statistics (for foundational statistical knowledge) - programming (for ML implementation) - databases-sql (for big data querying)