Optimization Problems in Machine Learning

Similar documents
A Semi-Supervised Recommender System to Predict Online Job Offer Performance

Prediction of High-Cost Hospital Patients Jonathan M. Mortensen, Linda Szabo, Luke Yancy Jr.

Predicting Medicare Costs Using Non-Traditional Metrics

Statistical Analysis Tools for Particle Physics

Accreditation Standards 2014 Diagnostic Imaging

Enhancing Sustainability: Building Modeling Through Text Analytics. Jessica N. Terman, George Mason University

Settling for Academia? H-1B Visas and the Career Choices of International Students in the United States

Technical Notes on the Standardized Hospitalization Ratio (SHR) For the Dialysis Facility Reports

Applying client churn prediction modelling on home-based care services industry

Tree Based Modeling Techniques Applied to Hospital Length of Stay

Department of Defense DIRECTIVE

Chronic Risk and Disease Management Model Using Structured Query Language and Predictive Analysis

Faculty of Computer Science

Differences in employment histories between employed and unemployed job seekers

The attitude of nurses towards inpatient aggression in psychiatric care Jansen, Gradus

Guidelines for Mammography Additional Qualification

Palomar College ADN Model Prerequisite Validation Study. Summary. Prepared by the Office of Institutional Research & Planning August 2005

Dynamic PRA of a Multi-unit Plant

AN APPOINTMENT ORDER OUTPATIENT SCHEDULING SYSTEM THAT IMPROVES OUTPATIENT EXPERIENCE

QUEUING THEORY APPLIED IN HEALTHCARE

Guide to Using the Common Intake and Assessment Tool

Technical Notes for HCAHPS Star Ratings (Revised for April 2018 Public Reporting)

Satisfaction and Experience with Health Care Services: A Survey of Albertans December 2010

CURRICULUM VITAE. Assistant Professor, Department of Mathematics, College of Arts and Sciences, University of Dayton.

DARPA-BAA EXTREME Frequently Asked Questions (FAQs) as of 10/7/16

Technical Notes for HCAHPS Star Ratings (Revised for October 2017 Public Reporting)

Statistical Analysis for the Military Decision Maker (Part II) Professor Ron Fricker Naval Postgraduate School Monterey, California

Supplementary Material Economies of Scale and Scope in Hospitals

2013, Vol. 2, Release 1 (October 21, 2013), /10/$3.00

New gtld Program. Community Priority Evaluation Result. Report Date: 10 February 2016

Introduction to Handwritten Signature Verification

A Reality Check on Health Information Privacy: How should we understand re-identification risks under HIPAA?

Targeted technology and data management solutions for observational studies

FCSM Research and Policy Conference March 8, 2018 Joshua Goldstein

Incident Reporting Systems

NEWS RELEASE. New funding to improve access to surgeries and MRI scans in British Columbia

Chapter 2 Nursing Process

M. Sc. Programme in Big Data Analytics

Employed and Unemployed Job Seekers and the Business Cycle*

University of Michigan Health System Analysis of Wait Times Through the Patient Preoperative Process. Final Report

Protocol for Assigning Hospitals to Groups under The Public Hospitals Act Stakeholders Copy

smart technologies Neonatal incubator from standard to intensive care

Identifying step-down bed needs to improve ICU capacity and costs

How to Help Write a Good Consent Form: MOVING FROM! INFORMED CONSENT to INFORMED CHOICE

Historical Imagery Digitization Data Project

Operator Assignment and Routing Problems in Home Health Care Services

Fuzzy Set SEG

INPATIENT SURVEY PSYCHOMETRICS

Preservative License Plate De-identification for Privacy Protection

(Consolidated up to 113/2009) ALBERTA REGULATION 61/2005. Health Professions Act

Recommendations to Health Quality Ontario

Staffing and Scheduling

Planning Calendar Grade 5 Advanced Mathematics. Monday Tuesday Wednesday Thursday Friday 08/20 T1 Begins

Nowcasting and Placecasting Growth Entrepreneurship. Jorge Guzman, MIT Scott Stern, MIT and NBER

27A: For the purposes of the BAA, a non-u.s. individual is an individual who is not a citizen of the U.S. See Section III.A.2 of the BAA.

HOWARD UNIVERSITY Position Description. POSITION TITLE: Radiation Safety Officer SALARY GRADE: HU-13. DATE REVISED: December 01, 2014 EEO CODE: 02

Standard EC Elements of Performance for EC The hospital manages fire risks.

HOW TO USE THE WARMBATHS NURSING OPTIMIZATION MODEL

Continuously Measuring Patient Outcome using Variable Life-Adjusted Displays (VLAD)

Does access to information technology make people happier? Insights from well-being surveys from around the world*

Automatically Recommending Healthy Living Programs to Patients with Chronic Diseases through Hybrid Content-Based and Collaborative Filtering

TITLE: Low Band Telemedicine Decision Support System for Disaster Situations

Turning Big Data Into Better Care

Vulnerable Patients and the Patient Experience. Dennis O. Kaldenberg, Ph.D. Chief Scientist

Pricing and funding for safety and quality: the Australian approach

COMPdata ICD-10 Transition Guide

Radiologic technologists take x rays and administer nonradioactive materials into patients bloodstreams for diagnostic purposes.

Profit Efficiency and Ownership of German Hospitals

DEVELOPMENT AND PERFORMANCE OF TEXT-MINING ALGORITHMS TO EXTRACT SOCIOECONOMIC STATUS FROM DE-IDENTIFIED ELECTRONIC HEALTH RECORDS

Craigslist s Effect on Violence Against Women

Scottish Hospital Standardised Mortality Ratio (HSMR)

Improving the public health sector in South Africa: eliciting public preferences using a discrete choice experiment

University of Arkansas for Medical Sciences. Part I - Safety Management Plan FY18

Putting Nanotechnology on the Map

Exploring the Structure of Private Foundations

NHS Digital is the new trading name for the Health and Social Care Information Centre (HSCIC).

CAPACITY PLANNING AND MANAGEMENT IN HOSPITALS

The Economic Incidence of Federal Student Grant Aid

Data-Driven Patient Scheduling in Emergency Departments: A Hybrid Robust Stochastic Approach

Kaushallya Adhikari. B.E. in Electronics and Communication Engineering

2 Quality Assurance In A Diagnostic Radiology Department. 1.1 Aim. 1.2 Introduction. 1.3 Key Elements of Quality assurance

Household survey on access and use of medicines

UCSF MEDICAL CENTER JOB DESCRIPTION MANAGER S SIGNATURE:

The Relationship between Structural and Psychological Empowerment and Participation in Continuing Professional Development in Oncology Nurses

Supplemental materials for:

Anesthesia Knowledge Test Series Examinations Instructions. Revised as of: 07/31/2014 Metrics Associates, Inc John A. Jensen

40 High-Paying Jobs That Don't Require A Bachelor's Degree

Maximizing the Power of Your Data. Peggy Connorton, MS, LNFA AHCA Director, Quality and LTC Trend Tracker

SCIENCE COMMITTEE PROGRAMME FOUNDATION AWARDS OUTLINE APPLICATION GUIDELINES

Proximity and Software Programming: IT Outsourcing and the Local Market

Fertility Response to the Tax Treatment of Children

Inferring Hospital Quality from Patient Discharge Records Using a Bayesian Selection Model

Tools for risk assessment in radiation therapy

Document de treball de l IEB 2011/12

Radiation Safety Code of Practice

University of Manitoba Graduate Courses in Community Health Sciences

Health service availability and health seeking behaviour in resource poor settings: evidence from Mozambique

SMALL GROUP SESSION 6A September 22 nd or September 24 th

Chemotherapy appointment scheduling under uncertainty using mean-risk stochastic integer programming

Criteria for Adjudication of Echocardiography Facilities May 2018

Transcription:

Optimization Problems in Machine Learning Katya Scheinberg Lehigh University 2/15/12 EWO Seminar 1

Binary classification problem Two sets of labeled points - + 2/15/12 EWO Seminar 2

Binary classification problem How to label this new point? - + 2/15/12 EWO Seminar 3

Binary classification problem Probably green - + 2/15/12 EWO Seminar 4

Binary classification problem - + What about this one? 2/15/12 EWO Seminar 5

Binary classification problem - + Or this one? 2/15/12 EWO Seminar 6

Examples from image classification l Optical character recognition l Automatically read digits in zip code l 256 dim vector of pixels, 10 classes, l classification or clustering task l Face recognition and detection l much larger dimension, nonlinear representation, l Non-euclidean similarity measures 2/15/12 EWO Seminar 7

Examples from text and internet l Text categorization l detect spam/nonspam emails l Many possible features l l False positives are very bad, false negatives are OK. l Online setting possible, huge data sets. choose articles of interest to individualize news sites l Large dimension size of dictionary, small training set, possibly online setting l Only few words are important. l Ranking l Predict a page rank for a given a search query l How to do it? Predict relative ranks of each pair of pages? 2/15/12 EWO Seminar 8

l l Examples from Medicine Functional Magnetic resonance imaging l Uses a standard MRI scanner to acquire images of functionally meaningful brain activity l l l l Measures changes in blood oxygenation Non-invasive, no ionizing radiation Good combination of spatial / temporal resolution l Voxel sizes ~4mm l Time of Repetition (TR) ~1s About 30000 voxels are active and measured. Only a few (probably) contribute to what the subject is feeling during the experiment (anger, frustration, boredom..) Breast cancer risk patients l l l l Take several measurements of a patient and some basic characteristics an predict if the patient is at high risk Low dimensional, but very different attributes. Large scale data. May involve active learning additional labels obtained by involving more tests or a professional. KDD 2008 cup challenge 2/15/12 EWO Seminar 9 fmri image courtesy of fmri Research Center @ Columbia Unoversity

The binary classification problem 2/15/12 EWO Seminar 10

Example 1 SUPPORT VECTOR MACHINES 2/15/12 EWO Seminar 11

Linear classifier Idea: separate a space into two half-spaces - + 2/15/12 EWO Seminar 12

Linear classifier Like this: - + 2/15/12 EWO Seminar 13

Linear classifier (0,1) - + w (1, 0) 2/15/12 EWO Seminar 14

Linear classifier (0,1) - + w (1, 0) 2/15/12 EWO Seminar 15

Linear classifier - + 2/15/12 EWO Seminar 16

Support vector machines - + Find the largest r or the smallest w 2/15/12 EWO Seminar 17

Support vector machines - + 2/15/12 EWO Seminar 18

Optimization Problem How many variables? Constraints? What can go wrong? 2/15/12 EWO Seminar 19

Support vector machines - + 2/15/12 EWO Seminar 20

Soft margin SVM How many variables? Constraints? 2/15/12 EWO Seminar 21

Soft margin SVM No constraints, but nonsmooth objective What if n is very large? What if m is very large? 2/15/12 EWO Seminar 22

Oh, no! What do we do now? + - + 2/15/12 EWO Seminar 23

Kernel SVM + - + 2/15/12 EWO Seminar 24

Kernel SVM + - + 2/15/12 EWO Seminar 25

Example 2 COLLABORATIVE FILTERING, NETFLIX CHALLENGE 2/15/12 EWO Seminar 26

l Some users rate some movies they watched (or didn t!) l Predict the rating (1..5) for each user/ movie pair. l Use this prediction to recommend users the movies that they would like 2/15/12 EWO Seminar 27

Matrix completion problem, collaborative filtering Collaborative filtering: famous Netflix challenge Will user i like movie j? Complete the matrix based on partially filled information. 2/15/12 EWO Seminar 28

Linear factor model 2/15/12 EWO Seminar 29

Convex relaxation via nuclear norm l Given the values for a subset of entries, find the matrix with these entries and the smallest (or given) rank. l NP-hard problem. 2/15/12 EWO Seminar 30

Convex relaxation via nuclear norm l Given the values for a subset of entries, find the matrix with these entries and the smallest nuclear norm. l Convex problem 2/15/12 EWO Seminar 31

Convex relaxation via nuclear norm l Given the values for a subset of entries, find the matrix with similar entries and the smallest nuclear norm. l Or 2/15/12 EWO Seminar 32

SPARSE REGRESSION, LASSO 2/15/12 EWO Seminar 33

Least Squares Linear Regression 2/15/12 EWO Seminar 34

Disease state prediction 2/15/12 EWO Seminar 35

Least squares problem Standard form of LS problem A has 500000 columns and 5000 rows underdetermined. Regularized regression can be used x is going to be dense hence linear combination of all factors (genes) We would prefer to find a linear combinations of as few genes as possible 2/15/12 EWO Seminar 36

Lasso and other formulations to recover structure Sparse regularized regression or Lasso: Sparse regressor selection Noisy signal recovery 2/15/12 EWO Seminar 37

SPARSE INVERSE COVARIANCE SELECTION 2/15/12 EWO Seminar 38

Sparse inverse covariance selection 2/15/12 EWO Seminar 39

Optimizing log likelihood 2/15/12 EWO Seminar 40

Enforcing sparsity l Convex relaxation l Convex optimization problem with unique solution for each ½ 2/15/12 EWO Seminar 41

SOLUTION APPROACHES 2/15/12 EWO Seminar 42

Examples Lasso SVM Collaborative filtering Robust PCA SICS

Alternating directions (splitting) method Consider: Relax constraints via Augmented Lagrangian technique In our examples f(x) and g(y) are both such that the above functions are easy to optimize in x or y 2/15/12 EWO Seminar 44

A variant of alternating directions method This turns out to be equivalent to 2/15/12 EWO Seminar Goldfarb, Ma and S, 10 45

Alternating linearization method (ALM) 2/15/12 EWO Seminar 46 Goldfarb, Ma, S, 10

What is involved? l Theoretical convergence guarantees and convergence rates have been developed l The real complexity depends on the choice of µ l Various strategies for parameter selection affect performance and have extra costs. l Depending on application minimization and gradient computations can be expensive. l Inexact computations may be utilized but may lead to worse convergence properties. l Parallelization? Stochastic sampling? 2/15/12 EWO Seminar 47

THANK YOU! 2/15/12 EWO Seminar 48