책 이미지
책 정보
· 분류 : 외국도서 > 과학/수학/생태 > 수학 > 확률과 통계 > 일반
· ISBN : 9781032090856
· 쪽수 : 314쪽
목차
1. IntroductionA Simple ExampleImportant ConceptsA More Complex ExampleFeature SelectionAn Outline of the BookComputing2. Illustrative Example: Predicting Risk of Ischemic StrokeSplittingPreprocessingExplorationPredictive Modeling Across SetsOther ConsiderationsComputing3. A Review of the Predictive Modeling ProcessIllustrative Example: OkCupid Profile DataMeasuring PerformanceData SplittingResamplingTuning Parameters and OverfittingModel Optimization and TuningComparing Models Using the Training SetFeature Engineering Without OverfittingSummaryComputing4. Exploratory VisualizationsIntroduction to the Chicago Train Ridership DataVisualizations for Numeric Data: Exploring Train Ridership DataVisualizations for Categorical Data: Exploring the OkCupid DataPost Modeling Exploratory VisualizationsSummaryComputing5. Encoding Categorical PredictorsCreating Dummy Variables for Unordered CategoriesEncoding Predictors with Many CategoriesApproaches for Novel CategoriesSupervised Encoding MethodsEncodings for Ordered DataCreating Features from Text DataFactors versus Dummy Variables in Tree-Based ModelsSummaryComputing6. Engineering Numeric PredictorsTransformationsMany TransformationsMany: Many TransformationsSummaryComputing7. Detecting Interaction EffectsGuiding Principles in the Search for InteractionsPractical ConsiderationsThe Brute-Force Approach to Identifying Predictive InteractionsApproaches when Complete Enumeration is Practically ImpossibleOther Potentially Useful ToolsSummaryComputing8. Handling Missing DataUnderstanding the Nature and Severity of Missing InformationModels that are Resistant to Missing ValuesDeletion of DataEncoding MissingnessImputation methodsSpecial CasesSummaryComputing9. Working with Profile DataIllustrative Data: Pharmaceutical Manufacturing MonitoringWhat are the Experimental Unit and the Unit of Prediction?Reducing BackgroundReducing Other NoiseExploiting CorrelationImpacts of Data Processing on ModelingSummaryComputing10. Feature Selection OverviewGoals of Feature SelectionClasses of Feature Selection MethodologiesEffect of Irrelevant FeaturesOverfitting to Predictors and External ValidationA Case StudyNext StepsComputing11. Greedy Search MethodsIllustrative Data: Predicting Parkinson’s DiseaseSimple FiltersRecursive Feature EliminationStepwise SelectionSummaryComputing12. Global Search MethodsNaive Bayes ModelsSimulated AnnealingGenetic AlgorithmsTest Set ResultsSummaryComputing