A Principal Component Analysis for Trees (2008)
Aydin, Burcu, Pataki, Gabor, Wang, Haonan, Bullitt, Elizabeth, Marron, J. S.
The active field of Functional Data Analysis (about understanding the variation in a set of curves) has been recently extended to Object Oriented Data Analysis, which considers populations of more...
MultiResolution Anomaly Detection Method for Long Range Dependent Time Series (2008)
Zhang, Lingsong, Zhu, Zhengyuan, Marron, J. S.
Driven by network intrusion detection, we propose a MultiResolution Anomaly Detection (MRAD) method, which effectively utilizes the multiscale properties of Internet features and network anomalies....
SiZer for Censored Density and Hazard Estimation (2008)
Jiang, Jiancheng, Marron, J. S.
The SiZer method is extended to nonparametric hazard estimation and also to censored density and hazard estimation. The new method allows quick, visual statistical inference about the important issue...
Analysis of nonlinear modes of variation for functional data (2007)
A set of curves or images of similar shape is an increasingly common functional data set collected in the sciences. Principal Component Analysis (PCA) is the most widely used technique to decompose...
Object oriented data analysis: Sets of trees (2007)
Object oriented data analysis is the statistical analysis of populations of complex objects. In the special case of functional data analysis, these data objects are curves, where standard Euclidean...
A scale-based approach to finding effective dimensionality in manifold learning (2007)
The discovering of low-dimensional manifolds in high-dimensional data is one of the main goals in manifold learning. We propose a new approach to identify the effective dimension (intrinsic...
SiZer for time series: A new approach to the analysis of trends (2007)
Rondonotti, Vitaliana, Marron, J. S., Park, Cheolwoo
Smoothing methods and SiZer are a useful statistical tool for discovering statistically significant structure in data. Based on scale space ideas originally developed in the computer vision...
The molecular portraits of breast tumors are conserved acress microarray platforms (2006)
Hu, Zhiyuan, Fan, Cheng, Oh, Daniel S., Marron, J. S., He, Xiaping, Qaqish, Bahjat F., ...
BackgroundValidation of a novel gene expression signature in independent data sets is a critical step in the development of a clinically useful test for cancer patient risk-stratification. However,...
Stephen G. Eick, Todd L. Graves, Alan F. Karr, J. S. Marron, Audris Mockus
A central feature of the evolution of large software systems is that change --- which is necessary to add new functionality, accommodate new hardware and repair faults --- becomes increasingly...
Variable Heavy Tails in Internet Traffic (2003)
J. S. Marron, Gennady Samorodnitsky, F. D. Smith
This paper studies tails of the size distribution of Internet data flows and their "heaviness". Data analysis motivates the concepts of moderate, far and extreme tails for understanding the richness...
Variable Heavy Tailed Durations in Internet Trac (2003)
J. S. Marron, Gennady Samorodnitsky, F. D. Smith
This paper studies tails of the duration distribution of internet data ows, and their heaviness".
Extremal Dependence: Internet Traffic Applications (2002)
Campos, F. H., Marron, J. S., Resnick, S. I., Jeffay, K.
Extremal Dependence: Internet Traffic Applications
Extremal Dependence: Internet Traffic Applications (2002)
Campos, F. H., Marron, J. S., Resnick, S. I., Jeffay, K.
Extremal Dependence: Internet Traffic Applications
Variable Heavy Tailed Durations in Internet Traffic, Part II: Theoretical Implications (2002)
This paper is part of a larger paper that studies tails of the duration distribution of Internet data flows, and their "heaviness". Data analysis motivates the concepts of moderate, far and extreme...
Variable Heavy Tailed Durations in Internet Traffic (2002)
J. S. Marron, Gennady Samorodnitsky, F. D. Smith
This paper studies tails of the duration distribution of internet data flows, and their "heaviness"...
Variable Heavy Tailed Durations in Internet Traffic, Part I: Understanding Heavy Tails (2002)
J. S. Marron, G. Samorodnitsky, F. D. Smith
This paper is part of a larger paper that studies tails of the duration distribution of Internet data flows, and their "heaviness". Data analysis motivates the concepts of moderate, far and extreme...
Mice and Elephants Visualization of Internet (2002)
J. S. Marron, Felix Hernandez-campos, F. D. Smith
Internet tra#c is composed of flows, sets of packets being transferred from one computer to another. Some visualizations for understanding the set of flows at a busy internet link are developed....
Variable Heavy Tailed Durations in Internet Traffic, (2002)
J. S. Marron, G. Samorodnitsky, F. D. Smith
This paper is part of a larger paper that studies tails of the duration distribution of Internet data flows, and their "heaviness". Data analysis motivates the concepts of moderate, far and extreme...
Distance Weighted Discrimination (2002)
High Dimension Low Sample Size statistical analysis is becoming increasingly important in a wide range of applied contexts. In such situations, it is seen that the appealing discrimination method...
Mice and Elephants Visualization of Internet Traffic (2002)
J. S. Marron, Felix Hernandez-campos, F. D. Smith
Internet traffic is composed of flows, sets of packets being transferred from one computer to another. Some visualizations for understanding the set of flows at a busy internet link are developed....
Variable heavy tailed durations in internet traffic (2002)
Hernandez-Campos, F., Marron, J. S., Samorodnitsky, G., Smith, F. D.
Variable heavy tailed durations in internet traffic
Variable heavy tailed durations in internet traffic (2002)
Hernandez-Campos, F., Marron, J. S., Samorodnitsky, G., Smith, F. D.
Variable heavy tailed durations in internet traffic
Curvature vs. Slope Inference for Features in Nonparametric Curve Estimates (2002)
Curvature vs. Slope Inference for Features in Nonparametric Curve Estimates
A SiZer analysis of IP Flow start times (2002)
Marron, J. S., Hernandez-Campos, F., Smith, F. D.
A SiZer analysis of IP Flow start times
Curvature vs. Slope Inference for Features in Nonparametric Curve Estimates (2002)
Curvature vs. Slope Inference for Features in Nonparametric Curve Estimates
A SiZer analysis of IP Flow start times (2002)
Marron, J. S., Hernandez-Campos, F., Smith, F. D.
A SiZer analysis of IP Flow start times
On the Modified Likelihood for Density Estimation. (2002)
It is shown that some recent results of Wong (1983) concerning his version of the modified likelihood criterion for smoothing parameter selection in kernel density estimation can be very misleading,...
Keywords: Windows; Kernel density estimates; Asymptotic properties; Distribution functions; Nonparametric statistics; Random variables; Stochastic processes.
The Amount of Noise Inherent in Bandwidth Selection for a Kernel Density Estimator. (2002)
Any practical method of constructing a bandwidth must depend only on a statistical sample, and should produce some sort of estimate of this bandwidth. The purpose of this paper is to show that there...
This paper makes two important contributions to the theory of bandwidth selection for kernel density estimators under right censorship. First, an asymptotic representation of the integrated squared...
Log-normal Durations Can Give Long Range Dependence (2001)
Hannig, J., Marron, J. S., Samorodnitsky, G., Smith, F. D.
Log-normal Durations Can Give Long Range Dependence
Log-normal Durations Can Give Long Range Dependence (2001)
Hannig, J., Marron, J. S., Samorodnitsky, G., Smith, F. D.
Log-normal Durations Can Give Long Range Dependence
A General Projection Framework for Constrained Smoothing (2001)
Mammen, E., Marron, J. S., Turlach, B. A., Wand, M. P.
There are a wide array of smoothing methods available for finding structure in data. A general framework is developed which shows that many of these can be viewed as a projection of the data, with...
Intuitive, Localized Analysis of Shape Variability (2001)
Paul Yushkevich, Stephen M. Pizer, Sarang Joshi, J. S. Marron
. Analysis of shape variability is important for diagnostic classification and understanding of biological processes. We present a novel shape analysis approach based on a multiscale medial...
Intuitive, Localized Analysis of Shape Variability (2000)
Paul Yushkevich, Stephen M. Pizer, Sarang Joshi, J. S. Marron
Analysis of shape variability is important for diagnostic classification and understanding of biological processes. We present a novel shape analysis approach based on a multiscale medial...
Scale space view of curve estimation (2000)
Chaudhuri, Probal, Marron, J. S.
Scale space theory from computer vision leads to an interesting and novel approach to nonparametric curve estimation. The family of smooth curve estimates indexed by the smoothing parameter can be...
Does Code Decay? Assessing the Evidence from Change Management Data (2000)
Stephen G. Eick, Todd L. Graves, Alan F. Karr, J. S. Marron, Audris Mockus
A central feature of the evolution of large software systems is that change --- which is necessary to add new functionality, accommodate new hardware and repair faults --- becomes increasingly...
Interactive Local Bandwidth Choice (1999)
A tool for user choice of the local bandwidth function for a kernel density estimate is developed using KDE, a graphical object-oriented package for interactive kernel density estimation written in...
Does Code Decay? Assessing the Evidence from Change Management Data (1999)
Stephen G. Eick, Todd L. Graves, Alan F. Karr, J. S. Marron, Audris Mockus
A central feature of the evolution of large software systems is that change --- which is necessary to add new functionality, accommodate new hardware and repair faults --- becomes increasingly...
Does Code Decay? Assessing the Evidence from Change Management Data (1999)
Stephen G. Eick, Todd L. Graves, Alan F. Karr, J. S. Marron, Audris Mockus
A central feature of the evolution of large software systems is that change --- which is necessary to add new functionality, accommodate new hardware and repair faults --- becomes increasingly...
Predicting Fault Incidence Using Software Change History (1999)
Todd L. Graves, Alan F. Karr, J. S. Marron, Harvey Siy
This paper is an attempt to understand the processes by which software ages. We de#ne code to be aged or decayed if its structure makes it unnecessarily di#cult to understand or change, and we...
Predicting Fault Incidence Using Software Change History (1999)
Todd L. Graves, Alan F. Karr, J. S. Marron, Harvey Siy
This paper is an attempt to understand the processes by which software ages. We define code to be aged or decayed if its structure makes it unnecessarily difficult to understand or change, and we...
Connected Teaching of Statistics (1999)
Härdle, Wolfgang, Klinke, Sigbert, Marron, J. S.
Statistics is considered to be a difficult science since it requires a variety of skills including handling of quantitative data, graphical insights as well as mathematical ability. Yet ever...
Connected Teaching of Statistics (1999)
Härdle, Wolfgang, Klinke, Sigbert, Marron, J. S.
Statistics is considered to be a difficult science since it requires a variety of skills including handling of quantitative data, graphical insights as well as mathematical ability. Yet ever...
Local Polynomial Smoothing Under Qualitative Constraints (1999)
J. S. Marron, B. A. Turlach, M. P. Wand
Some nonparametric regression settings involve auxiliary information, e.g., the support function of a convex set is characterized by the fact that the sum with its second derivative is non negative....
Local Polynomial Smoothing Under Qualitative Constraints (1999)
J. S. Marron, B. A. Turlach, M. P. Wand
Some nonparametric regression settings involve auxiliary information, e.g., the support function of a convex set is characterized by the fact that the sum with its second derivative is non negative....
Robust principal component analysis for functional data (1999)
Locantore, N., Marron, J.S., Simpson, D.G., Tripoli, N., Zhang, J.T., Cohen, K.L.
Robust principal component analysis for functional data (1999)
Locantore, N., Marron, J.S., Simpson, D.G., Tripoli, N., Zhang, J.T., Cohen, K.L.
Interactive Local Bandwidth Choice (1998)
A tool for user choice of the local bandwidth function for kernel density and nonparametric regression estimates is developed using KDE, a graphical object-oriented package for interactive kernel...
Interactive Local Bandwidth Choice (1998)
A tool for user choice of the local bandwidth function for kernel density and nonparametric regression estimates is developed using KDE, a graphical object-oriented package for interactive kernel...
Interactive Local Bandwidth Choice (1998)
A tool for user choice of the local bandwidth function for a kernel density estimate is developed using KDE, a graphical object-oriented package for interactive kernel density estimation written in...
Interactive Local Bandwidth Choice (1998)
A tool for user choice of the local bandwidth function for a kernel density estimate is developed using KDE, a graphical object-oriented package for interactive kernel density estimation written in...
Interactive Local Bandwidth Choice (1998)
A tool for user choice of the local bandwidth function for a kernel density estimate is developed using KDE, a graphical object-oriented package for interactive kernel density estimation written in...
Significance of Features via SiZer (1998)
SiZer is an exploratory data analysis tool, that works in conjuction with smoothing methods. It addresses the issue that is often central to the use of smoothing in data analysis: which observed...
SiZer for exploration of structures in curves (1998)
In the use of smoothing methods in data analysis, an important question is often: which observed features are "really there?", as opposed to being spurious sampling artifacts. An approach is...
SiZer for exploration of structures in curves (1998)
In the use of smoothing methods in data analysis, an important question is often: which observed features are "really there?", as opposed to being spurious sampling artifacts. An approach is...
SiZer for exploration of structures in curves (1998)
In the use of smoothing methods in data analysis, an important question is often: which observed features are "really there?", as opposed to being spurious sampling artifacts. An approach is...
On automatic boundary corrections (1997)
Cheng, Ming-Yen, Fan, Jianqing, Marron, J. S.
Many popular curve estimators based on smoothing have difficulties caused by boundary effects. These effects are visually disturbing in practice and can play a dominant role in theoretical analysis....
Curve estimation when the design density is low (1997)
Hall, Peter, Marron, J. S., Neumann, M. H., Titterington, D. M.
In problems where a high-dimensional design is projected into a lower number of dimensions, the density of the new design is typically not bounded away from zero over its support, even if the...
On Automatic Boundary Corrections (1996)
Ming-yen Cheng, Jianqing Fan, J. S. Marron
Many popular curve estimators based on smoothing have difficulties caused by boundary effects. These effects are visually disturbing in practice and can play a dominant role in theoretical analysis....
Interactive Local Bandwidth Choice (1995)
Marron, J. S., Udina I Abelló, Frederic
A tool for user choice of the local bandwidth function for a kernel density estimate is developed using KDE, a graphical object-oriented package for interactive kernel density estimation written in...
W. Hardle, J. S. Marron, L. Yang, Wirtschaftswissenschaftliche Fakultat
chaft. 1 Interpretability Many statisticians view simplicity and intuitive understanding of "what the smooth is doing to the data", as very important criteria in choosing a smoothing method. In this...
Repeated observation of breast tumor subtypes in independent gene expression data sets
Sørlie, Therese, Tibshirani, Robert, Parker, Joel, Hastie, Trevor, Marron, J. S., Nobel, Andrew, ...
Characteristic patterns of gene expression measured by DNA microarrays have been used to classify tumors into clinically relevant subgroups. In this study, we have refined the previously defined...
Repeated observation of breast tumor subtypes in independent gene expression data sets
Sørlie, Therese, Tibshirani, Robert, Parker, Joel, Hastie, Trevor, Marron, J. S., Nobel, Andrew, ...
Characteristic patterns of gene expression measured by DNA microarrays have been used to classify tumors into clinically relevant subgroups. In this study, we have refined the previously defined...
Geometric representation of high dimension, low sample size data
Peter Hall, J. S. Marron, Amnon Neeman
High dimension, low sample size data are emerging in various areas of science. We find a common structure underlying many such data sets by using a non-standard type of asymptotics: the dimension...
Visualization and inference based on wavelet coefficients, SiZer and SiNos
Park, Cheolwoo, Godtliebsen, Fred, Taqqu, Murad, Stoev, Stilian, Marron, J.S.
Partitioned cross-validation is proposed as a method for overcoming the large amounts of across sample variability to which ordinary cross-validation is subject. The price for cutting down on the...
Bandwidth choice for average derivative estimation
Haerdle,W., Hart,J.D., Marron,J.S., Tsybakov,A.B.
Average derivative,Bandwidth,Kernel estimators
Local minima in cross validation functions
Bandwidth selection,Cross validation,Kernel density estimators,Local minima,Smoothing parameter selection
Comparision of data-driven bandwidth selectors
Cross-validation,Data driven bandwidth selection,Density estimation,Kernel estimators,Plug-in method
Bootstrap simultaneous error bars for nonparametric regression
Bootstrap,Error Bars,Kernel smoothing,Nonparametric regression,Variability Bound
LASS: a tool for the local analysis of self-similarity
Stoev, Stilian, Taqqu, Murad S., Park, Cheolwoo, Michailidis, George, Marron, J.S.
BOOTSTRAP SIMULTANEOUS ERROR BARS FOR NONPARAMETRIC REGRESSION.
evaluation ; statistics ; econometric models
Asymptotically Best Bandwidth Selectors in Kernel Density Estimation.
Park, B.U., Kim, W.C., Marron, J.S.
information ; statistical analysis ; econometrics
Dependent SiZer: Goodness-of-Fit Tests for Time Series Models
Cheolwoo Park, J. S. Marron, Vitaliana Rondonotti
In this paper, we extend SiZer (SIgnificant ZERo crossing of the derivatives) to dependent data for the purpose of goodness-of-fit tests for time series models. Dependent SiZer compares the observed...
Interactive Local Bandwidth Choice
A tool for user choice of the local bandwidth function for a kernel density estimate is developed using KDE, a graphical object-oriented package for interactive kernel density estimation written in...