Learning Connectivity Networks from High-Dimensional Point Processes

Similar documents
Modeling Wine Quality Using Classification and Regression. Mario Wijaya MGT 8803 November 28, 2017

Online Appendix to. Are Two heads Better Than One: Team versus Individual Play in Signaling Games. David C. Cooper and John H.

Multiple Imputation for Missing Data in KLoSA

Missing value imputation in SAS: an intro to Proc MI and MIANALYZE

STABILITY IN THE SOCIAL PERCOLATION MODELS FOR TWO TO FOUR DIMENSIONS

Predicting Wine Quality

Imputation of multivariate continuous data with non-ignorable missingness

Zeitschrift für Soziologie, Jg., Heft 5, 2015, Online- Anhang

Sponsored by: Center For Clinical Investigation and Cleveland CTSC

The multivariate piecewise linear growth model for ZHeight and zbmi can be expressed as:

CS 322: (Social and Information) Network Analysis Jure Leskovec Stanford University

MBA 503 Final Project Guidelines and Rubric

Wine Rating Prediction

Appendix A. Table A.1: Logit Estimates for Elasticities

SHEA ROASTING IN A NUTSHELL. Brian Gylland (& Paul Means) Burn Design Lab ETHOS January 26-28, 2018

DETERMINANTS OF DINER RESPONSE TO ORIENTAL CUISINE IN SPECIALITY RESTAURANTS AND SELECTED CLASSIFIED HOTELS IN NAIROBI COUNTY, KENYA

Computerized Models for Shelf Life Prediction of Post-Harvest Coffee Sterilized Milk Drink

The Market Potential for Exporting Bottled Wine to Mainland China (PRC)

An application of cumulative prospect theory to travel time variability

Decision making with incomplete information Some new developments. Rudolf Vetschera University of Vienna. Tamkang University May 15, 2017

Hybrid ARIMA-ANN Modelling for Forecasting the Price of Robusta Coffee in India

Gail E. Potter, Timo Smieszek, and Kerstin Sailer. April 24, 2015

Napa Highway 29 Open Wineries

Flexible Working Arrangements, Collaboration, ICT and Innovation

Problem Set #3 Key. Forecasting

Wine-Tasting by Numbers: Using Binary Logistic Regression to Reveal the Preferences of Experts

This appendix tabulates results summarized in Section IV of our paper, and also reports the results of additional tests.

Illinois Geometry Lab. Percolation Theory. Authors: Michelle Delcourt Kaiyue Hou Yang Song Zi Wang. Faculty Mentor: Kay Kirkpatrick

Internet Appendix to. The Price of Street Friends: Social Networks, Informed Trading, and Shareholder Costs. Jie Cai Ralph A.

Emerging Local Food Systems in the Caribbean and Southern USA July 6, 2014

Internet Appendix for Does Stock Liquidity Enhance or Impede Firm Innovation? *

Lesson 23: Newton s Law of Cooling

Mobility tools and use: Accessibility s role in Switzerland

Supporing Information. Modelling the Atomic Arrangement of Amorphous 2D Silica: Analysis

Restaurant Management

DIR2017. Training Neural Rankers with Weak Supervision. Mostafa Dehghani, Hamed Zamani, Aliaksei Severyn, Sascha Rothe, Jaap Kamps, and W.

A Note on H-Cordial Graphs

Emerging Research in Aromas & Sweetness Enhancement

Mapping and Detection of Downy Mildew and Botrytis bunch rot Resistance Loci in Norton-based Population

7 th Annual Conference AAWE, Stellenbosch, Jun 2013

IDOT and Illinois Tollway Connecting Northern Illinois Over the Years. Clarita Lao, Illinois Tollway September 29, 2017

Incremental Record Linkage. Anja Gruenheid!! Xin Luna Dong!!! Divesh Srivastava

The Sources of Risk Spillovers among REITs: Asset Similarities and Regional Proximity

wine 1 wine 2 wine 3 person person person person person

Academic Year 2014/2015 Assessment Report. Bachelor of Science in Viticulture, Department of Viticulture and Enology

AJAE Appendix: Testing Household-Specific Explanations for the Inverse Productivity Relationship

RELATIVE EFFICIENCY OF ESTIMATES BASED ON PERCENTAGES OF MISSINGNESS USING THREE IMPUTATION NUMBERS IN MULTIPLE IMPUTATION ANALYSIS ABSTRACT

Buying Filberts On a Sample Basis

Activity 10. Coffee Break. Introduction. Equipment Required. Collecting the Data

Promote and support advanced computing to further Tier-One research and education at the University of Houston

Relation between Grape Wine Quality and Related Physicochemical Indexes

Cloud Computing CS

2016 AGU Fall Meeting Scientific Program Public Affairs

Relationships Among Wine Prices, Ratings, Advertising, and Production: Examining a Giffen Good

STACKING CUPS STEM CATEGORY TOPIC OVERVIEW STEM LESSON FOCUS OBJECTIVES MATERIALS. Math. Linear Equations

THE ECONOMIC IMPACT OF BEER TOURISM IN KENT COUNTY, MICHIGAN

The Elasticity of Substitution between Land and Capital: Evidence from Chicago, Berlin, and Pittsburgh

Measuring economic value of whale conservation

AST Live November 2016 Roasting Module. Presenter: John Thompson Coffee Nexus Ltd, Scotland

Analysis of Fruit Consumption in the U.S. with a Quadratic AIDS Model

Biosignal Processing Mari Karsikas

From VOC to IPA: This Beer s For You!

Imputation Procedures for Missing Data in Clinical Research

MyPlate. National FCS Standard: Apply various dietary guidelines in planning to meet nutrition and wellness needs.

STUDY AND IMPROVEMENT FOR SLICE SMOOTHNESS IN SLICING MACHINE OF LOTUS ROOT

Missing Data Methods (Part I): Multiple Imputation. Advanced Multivariate Statistical Methods Workshop

Word Embeddings for NLP in Python. Marco Bonzanini PyCon Italia 2017

The Roles of Social Media and Expert Reviews in the Market for High-End Goods: An Example Using Bordeaux and California Wines

Jure Leskovec, Computer Science Dept., Stanford

Missing Data Treatments

PART I HAWAII HEALTH SYSTEMS CORPORATION STATE OF HAWAII Class Specifications for the Classes:

Online Appendix to The Effect of Liquidity on Governance

Reliable Profiling for Chocolate and Cacao

On-line Appendix for the paper: Sticky Wages. Evidence from Quarterly Microeconomic Data. Appendix A. Weights used to compute aggregate indicators

Soybean Yield Loss Due to Hail Damage*

Planning: Regression Planning

Final Exam Financial Data Analysis (6 Credit points/imp Students) March 2, 2006

A hip & healthy café that serves family destinations!

Problem Set #15 Key. Measuring the Effects of Promotion II

The Bank Lending Channel of Conventional and Unconventional Monetary Policy: A Euro-area bank-level Analysis

Identification of Adulteration or origins of whisky and alcohol with the Electronic Nose

Retail Technology Program - Digital Menu Board Preamble

UNIVERSITY COLLEGE OF ENGINEERING (A) OSMANIA UNIVERSITY, HYDERABAD B.E. III Year - II-SEMESTER (MAIN) PRACTICAL EXAMS.

A CASE STUDY: HOW CONSUMER INSIGHTS DROVE THE SUCCESSFUL LAUNCH OF A NEW RED WINE

Percolation Properties of Triangles With Variable Aspect Ratios

Feeling Hungry. How many cookies were on the plate before anyone started feeling hungry? Feeling Hungry. 1 of 10

L-Theanine Clinical Studies

Near-critical percolation and minimal spanning tree in the plane

CENTRAL AMERICA COFFEE RUST ACTION PLAN 2013 Component 1 Integrated Coffee Rust Management. LEADERS and PARTICIPANTS

IMSI Annual Business Meeting Amherst, Massachusetts October 26, 2008

Fedima Position Paper on Labelling of Allergens

EFFECT OF TOMATO GENETIC VARIATION ON LYE PEELING EFFICACY TOMATO SOLUTIONS JIM AND ADAM DICK SUMMARY

The aim of the thesis is to determine the economic efficiency of production factors utilization in S.C. AGROINDUSTRIALA BUCIUM S.A.

Marketing Strategy and Alliances Analysis of Starbucks Corporation

Detecting Melamine Adulteration in Milk Powder

PROCEDURE million pounds of pecans annually with an average

Internet Appendix. For. Birds of a feather: Value implications of political alignment between top management and directors

Big Data and the Productivity Challenge for Wine Grapes. Nick Dokoozlian Agricultural Outlook Forum February

Results April 2019

-- Final exam logistics -- Please fill out course evaluation forms (THANKS!!!)

Transcription:

Learning Connectivity Networks from High-Dimensional Point Processes Ali Shojaie Department of Biostatistics University of Washington faculty.washington.edu/ashojaie Feb 21st 2018

Motivation: Unlocking the Mysteries of the Brain The human brain is composed of 1011 neurons Question: How do neurons work together to perceive the world, make decisions, and perform other higher-level tasks? We will primarily focus on spike train data Sources: Allen Institute for Brain Science (left), Paul De Konnick lab (right) 1

Neuron Spike Train Data Time (s) 2

Neuron Spike Train Data Time (s) 2

Neuron Spike Train Data Time (s) 2

Neuron Spike Train Data Time (s) Spike Train: times at which a neuron spikes (transmits a signal) 2

Neuron Functional Connectivity Among Neurons Time (s) 3

Neuron Functional Connectivity Among Neurons Time (s) 3

Neuron Functional Connectivity Among Neurons Time (s) 3

Neuron Functional Connectivity Among Neurons Time (s) 3

Neuron Functional Connectivity Among Neurons Time (s) 3

Neuron Functional Connectivity Among Neurons Time (s) 3

Learning Functional Connectivity Networks 4

Learning Functional Connectivity Networks 4

Learning Functional Connectivity Networks 4

Learning Functional Connectivity Networks 4

Learning Functional Connectivity Networks 4

Neuron Challenges in Estimating Functional Connectivity Time (s) May observe thousands of neurons Limited theoretical justification Short duration of stationary period 5

Neuron Challenges in Estimating Functional Connectivity Time (s) May observe thousands of neurons Limited theoretical justification Short duration of stationary period 5

Neuron Challenges in Estimating Functional Connectivity Time (s) May observe thousands of neurons Limited theoretical justification Short duration of stationary period 5

Neuron Challenges in Estimating Functional Connectivity Time (s) May observe thousands of neurons Limited theoretical justification Short duration of stationary period 5

Neuron Challenges in Estimating Functional Connectivity Time (s) May observe thousands of neurons Limited theoretical justification Short duration of stationary period 5

Neuron Challenges in Estimating Functional Connectivity Time (s) May observe thousands of neurons Limited theoretical justification Short duration of stationary period 5

Hawkes Process Introduced by Hawkes (1971) First applied to spike train data by Brillinger et al. 6

A Linear Hawkes Process intensity process point process spontaneous rate transfer function from k to j time when the kth neuron has the ith spike 7

A Linear Hawkes Process intensity process point process spontaneous rate transfer function from k to j time when the kth neuron has the ith spike Functional connectivity: there s an edge from k to j if 7

Intensity of Train 1 A Simple Hawkes Process 8

Intensity of Train 1 A Simple Hawkes Process 8

Intensity of Train 1 A Simple Hawkes Process 8

Intensity of Train 1 A Simple Hawkes Process 8

Intensity of Train 1 A Simple Hawkes Process 8

Intensity of Train 1 A Simple Hawkes Process 8

Intensity of Train 2 A Simple Hawkes Process 8

Intensity of Train 2 A Simple Hawkes Process 8

Intensity of Train 2 A Simple Hawkes Process 8

Intensity of Train 2 A Simple Hawkes Process 8

Intensity of Train 2 A Simple Hawkes Process 8

Intensity of Train 2 A Simple Hawkes Process 8

Penalized Regression for Hawkes Processes Joint work with Shizhe Chen, Eric Shea-Brown, and Daniela Witten The multivariate Hawkes process in high dimensions: Beyond mutual excitation (arxiv:1707.04928); invited revision to Annals of Statistics Nearly assumptionless screening for the mutually-exciting multivariate Hawkes process (2017) Electronic Journal of Statistics 9

Penalized Regression for Hawkes Processes Regress each spike train onto others Neighbourhood selection Estimate incoming edges Joint work with Shizhe Chen, Eric Shea-Brown, and Daniela Witten The multivariate Hawkes process in high dimensions: Beyond mutual excitation (arxiv:1707.04928); invited revision to Annals of Statistics Nearly assumptionless screening for the mutually-exciting multivariate Hawkes process (2017) Electronic Journal of Statistics 9

Parameter Estimation via Penalized Regression Model Finite-dimensional basis expansion Least square loss Regression 10

Parameter Estimation via Penalized Regression Model Finite-dimensional basis expansion Least square loss Regression 10

Parameter Estimation via Penalized Regression Model Finite-dimensional basis expansion Squared error loss Regression 10

Parameter Estimation via Penalized Regression Model Finite-dimensional basis expansion Squared error loss Regression 10

Parameter Estimation via Penalized Regression Model Finite-dimensional basis expansion Squared error loss Regression 10

Parameter Estimation via Penalized Regression Model Finite-dimensional basis expansion Squared error loss Regression Estimation via block coordinate descent 10

Properties of Penalized Estimation Procedures Existing theory relies on the cluster process representation Assumes non-negative transfer functions Only holds for linear Hawkes processes 11

Gap in Existing Theory: Neurons Excite and Inhibit 12

A New Concentration Inequality for Hawkes Process New theoretical framework that allows inhibition Use the thinning process representation For any j, k, consider Here can be any continuous and integrable function covers a wide range of second-order statistics of the Hawkes process, including the cross-covariance We have 13

A New Concentration Inequality for Hawkes Process New theoretical framework that allows inhibition Use the thinning process representation For any j, k, consider Here can be any continuous and integrable function covers a wide range of second-order statistics of the Hawkes process, including the cross-covariance 13

A New Concentration Inequality for Hawkes Process New theoretical framework that allows inhibition Use the thinning process representation For any j, k, consider Here can be any continuous and integrable function covers a wide range of second-order statistics of the Hawkes process, including the cross-covariance We have 13

An Application of the New Concentration Inequality Neighbourhood selection recovers the graph with high probability where and are true and estimated edges Key assumptions, i.e., we can handle Stationarity Other regularity conditions for lasso-type estimators 14

A Computational Shortcut Penalized regression becomes computationally (and statistically) challenging with many neurons Can we reduce the number of potential edges? 15

A Computational Shortcut Let Vj,k be the cross-covariance between the jth & kth neurons Consider the graph defined by marginal screening This correlation graph is often used by neuroscientists It is computationally (and statistically) efficient 16

Cross-Correlation Graph 17

Cross-Correlation Graph 17

Cross-Correlation Graph 17

Cross-Correlation Graph 17

Cross-Correlation Graph 17

Cross-Correlation Graph 17

Cross-Correlation Graph 17

Properties of Screening Recall Q: How does relate to the functional connectivity network,? 18

Properties of Screening If the process is mutually exciting, 18

Properties of Screening If the process is mutually exciting, 18

Properties of Screening If the process is mutually exciting, These results can be shown using our new theoretical framework Unlike existing approaches, they do not require extra assumption 18

Properties of Screening If the process is mutually exciting, These results can be shown using our new theoretical framework Unlike existing approaches, they do not require extra assumption 18

Properties of Screening What if there are negative edges? 19

Properties of Screening What if there are negative edges? 19

Properties of Screening What if there are negative edges? Even with negative edges, screening detects connected components of the graph 19

Properties of Screening What if there are negative edges? Even with negative edges, screening detects connected components of the graph 19

Properties of Screening What if there are negative edges? Even with negative edges, screening detects connected components of the graph 19

Properties of Screening What if there are negative edges? Even with negative edges, screening detects connected components of the graph Screened Edges Connected Components 19

Neurons in Cat Visual Cortex 20

Addressing Non-Stationarity: Piecewise Stationary VARs Motivation: Analyzing EEG Data 21

Addressing Non-Stationarity: Piecewise Stationary VARs Motivation: Analyzing EEG Data 21

Addressing Non-Stationarity: Piecewise Stationary VARs Motivation: Analyzing EEG Data 21

Addressing Non-Stationarity: Piecewise Stationary VARs Motivation: Analyzing EEG Data Brain connectivities expected to change after seizure Goal: To locate the seizure and estimate before/after networks 21

Addressing Non-Stationarity: Piecewise Stationary VARs Our proposal: A 3-step procedure based on total variation penalty 22

Addressing Non-Stationarity: Piecewise Stationary VARs Our proposal: A 3-step procedure based on total variation penalty 22

Addressing Non-Stationarity: Piecewise Stationary VARs Our proposal: A 3-step procedure based on total variation penalty 22

Addressing Non-Stationarity: Piecewise Stationary VARs Our proposal: A 3-step procedure based on total variation penalty Joint work with Abolfazl Safikhani (Columbia Univ) Joint Structural Break Detection and Parameter Estimation in High-Dimensional Non-Stationary VAR Models (arxiv:1711.07357) 22

Acknowledgment Allen Institute for Brain Sciences Funding NIH: NIGMS & NHLBI NSF: DMS & DMS/NIGMS References Chen, Witten & Shojaie (2017) Nearly assumptionless screening for the mutually-exciting multivariate Hawkes process; Electronic Journal of Statistics, 11(1): 1207-1234. Chen, Shojaie, Shea-Brown & Witten (2018+) The multivariate Hawkes process in high dimensions: Beyond mutual excitation; revision invited to the Annals of Statistics (arxiv:1707.04928). Safikhani & Shojaie (2018+) Joint Structural Break Detection and Parameter Estimation in High-Dimensional Non-Stationary VAR Models (arxiv:1711.07357). 23

Acknowledgment Allen Institute for Brain Sciences Funding NIH: NIGMS & NHLBI NSF: DMS & DMS/NIGMS References Chen, Witten & Shojaie (2017) Nearly assumptionless screening for the mutually-exciting multivariate Hawkes process; Electronic Journal of Statistics, 11(1): 1207-1234. Chen, Shojaie, Shea-Brown & Witten (2018+) The multivariate Hawkes process in high dimensions: Beyond mutual excitation; revision invited to the Annals of Statistics (arxiv:1707.04928). Safikhani & Shojaie (2018+) Joint Structural Break Detection and Parameter Estimation in High-Dimensional Non-Stationary VAR Models (arxiv:1711.07357). Thank You! 23

Key Dates Modules: July 11-28 Registration now open

Appendix I Theory for Hawkes Process with Inhibitions

Recap: One-Dimensional Linear Hawkes Process intensity process point process spontaneous rate transfer function time of the ith spike 1

Hawkes Process is Temporally Dependent by Definition 2

Hawkes Process is Temporally Dependent by Definition Key to understanding the Hawkes process: quantifying the temporal dependence 2

Temporal Dependence of a Hawkes Process 3

Temporal Dependence of a Hawkes Process 3

Existing Theory Assumes Non-Negative Transfer Functions 4

Existing Theory Assumes Non-Negative Transfer Functions 4

Represent Processes by Thinning a Poisson Process s Full Process t 5

Represent Processes by Thinning a Poisson Process Spike s Full Process & Thinned Process t 5

Represent Processes by Thinning a Poisson Process Spike s Full Process & Thinned Process This representation applies to any stationary Hawkes process! t 5

Spike s Bounding the Temporal Dependence Using the Thinning Process Representation t Time u 6

Spike s Bounding the Temporal Dependence Using the Thinning Process Representation t 6

Appendix II Iterative Construction of Thinning Process Representation for Hawkes Process

Recap: One-Dimensional Linear Hawkes Process intensity process point process spontaneous rate transfer function time of the ith spike 1

Thinning Process Representation of the Hawkes Process Spike s n=1 t 1

Thinning Process Representation of the Hawkes Process n=2 Intensity in the previous iteration Spike s Intensity in the current iteration Removed spikes t New spikes 1

Thinning Process Representation of the Hawkes Process n=3 Intensity in the previous iteration Spike s Intensity in the current iteration Removed spikes t New spikes 1

Thinning Process Representation of the Hawkes Process n=4 Intensity in the previous iteration Spike s Intensity in the current iteration Removed spikes t New spikes 1

Thinning Process Representation of the Hawkes Process n=5 Intensity in the previous iteration Spike s Intensity in the current iteration Removed spikes t New spikes 1

Thinning Process Representation of the Hawkes Process n=6 Intensity in the previous iteration Spike s Intensity in the current iteration Removed spikes t New spikes 1

Thinning Process Representation of the Hawkes Process n=7 Intensity in the previous iteration Spike s Intensity in the current iteration Removed spikes t New spikes 1

Thinning Process Representation of the Hawkes Process n=8 Intensity in the previous iteration Spike s Intensity in the current iteration Removed spikes t New spikes 1

Thinning Process Representation of the Hawkes Process n=9 Intensity in the previous iteration Spike s Intensity in the current iteration Removed spikes t New spikes 1

Thinning Process Representation of the Hawkes Process Spike s n=9 t 2

Appendix III Cluster Process Representation for the Hawkes Process

Cluster Process Representation Proposed by Hawkes and Oakes (1974) Represent a Hawkes process as the summation of processes Consider a one-dimensional Hawkes process Hawkes process

Cluster Process Representation Proposed by Hawkes and Oakes (1974) Represent a Hawkes process as the summation of processes Consider a one-dimensional Hawkes process Hawkes process Ancestral process

Cluster Process Representation Proposed by Hawkes and Oakes (1974) Represent a Hawkes process as the summation of processes Consider a one-dimensional Hawkes process Hawkes process Descendants Ancestral process

Cluster Process Representation Proposed by Hawkes and Oakes (1974) Represent a Hawkes process as the summation of processes Consider a one-dimensional Hawkes process Hawkes process Descendants Ancestral process

Cluster Process Representation Proposed by Hawkes and Oakes (1974) Represent a Hawkes process as the summation of processes Consider a one-dimensional Hawkes process Hawkes process Descendants Ancestral process

Cluster Process Representation Proposed by Hawkes and Oakes (1974) Represent a Hawkes process as the summation of processes Consider a one-dimensional Hawkes process Hawkes process Descendants Ancestral process Only holds for linear Hawkes processes with

The End