Deep Learning

My current research interests are in the application of deep learning techniques to neutrino detectors. Traditional pattern recognition methods applied to the images from modern neutrino detectors typically perform poorly on complicated and busy neutrino events.

The application of deep neural networks to neutrino data is quite new in the last several years, in particular the NOvA experiment and the Microboone experiments are among the first neutrino experiments to adopt deep learning into their analysis.

As a member of Microboone, I am also a leader of the DeepLearnPhysics group which aims to provide guidance, tools, and expertise to students and researchers who want to apply deep learning to their difficult pattern recognition problems. We provide open source software development including a data format for fast, threaded IO of sparse particle physics data, visualization tools, and many implementations of important deep convolutional networks - see our github as well as our many tutorials and open data sets.

Personally, my application of deep learning has been to address some of the hardest problems we face in my research with tools that can be deployed at large scale over millions of neutrino interaction candidates. One of the projects I am working on currently is cosmogenic background tagging with semantic segmentation, in the SBND experiment.

The interactions of the SBND detector are flooded with cosmogenic muons that rain down upon the detector. In particular, because the detector is so slow to read out an interaction, a neutrino can be lost in all of the cosmic rays that also enter the event. Below, you can see the three different projections of a simulated event with a neutrino and several cosmic muons.

The first view of a muon neutrino interaction with cosmic muons overlaid. The second view of a muon neutrino interaction with cosmic muons overlaid. The third view of a muon neutrino interaction with cosmic muons overlaid.

To remove the problem of overlaid cosmics, I developed a deep neural network to label each pixel as either background, neutrino induced, or cosmic induced. The network is based off of U-Net and ResNet, though it offers a unique feature for this problem. Because the neutrino detector gives three projections of the same interaction, it’s possible for the network to learn cross plane correlations and infer geometrical properties of the interactions. For example, cosmic particle enter from the edges of the detector, and the neutrino interaction starts within the detector far from the edges. In 3D, looking at the simluation information, this is more clear:

3D view of neutrino interaction (towards the right) versus cosmic interactions.

The network architecture is then to perform traditional U-Net style upsampling and downsampling, but at the deepest, most abstracted layers the filters from each plane are concatenated together:

Cartoonish diagram of network design, with only two planes.

As an example of performance, you can see the output on the three images from above, where just the pixels labeled as neutrino have been kept:

The first view of a muon neutrino interaction with cosmic muons removed. The second view of a muon neutrino interaction with cosmic muons removed. The third view of a muon neutrino interaction with cosmic muons removed.

This work is currently being prepared for publication.