Deep Learning and LLM-based Methods Applied to Stellar Lightcurve Classification

Li, Yu-Yang; Bai, Yu; Wang, Cunshi; Qu, Mengwei; Lu, Ziteng; Soria, Roberto; Liu, Jifeng

Full-text links:

Download:

Current browse context:

astro-ph.IM

< prev | next >

new | recent | 2404

Astrophysics > Instrumentation and Methods for Astrophysics

Title: Deep Learning and LLM-based Methods Applied to Stellar Lightcurve Classification

Authors: Yu-Yang Li, Yu Bai, Cunshi Wang, Mengwei Qu, Ziteng Lu, Roberto Soria, Jifeng Liu

(Submitted on 16 Apr 2024)

Abstract: Light curves serve as a valuable source of information on stellar formation and evolution. With the rapid advancement of machine learning techniques, it can be effectively processed to extract astronomical patterns and information. In this study, we present a comprehensive evaluation of deep-learning and large language model (LLM) based models for the automatic classification of variable star light curves, based on large datasets from the Kepler and K2 missions. Special emphasis is placed on Cepheids, RR Lyrae, and eclipsing binaries, examining the influence of observational cadence and phase distribution on classification precision. Employing AutoDL optimization, we achieve striking performance with the 1D-Convolution+BiLSTM architecture and the Swin Transformer, hitting accuracies of 94\% and 99\% correspondingly, with the latter demonstrating a notable 83\% accuracy in discerning the elusive Type II Cepheids-comprising merely 0.02\% of the total dataset.We unveil StarWhisper LightCurve (LC), an innovative Series comprising three LLM-based models: LLM, multimodal large language model (MLLM), and Large Audio Language Model (LALM). Each model is fine-tuned with strategic prompt engineering and customized training methods to explore the emergent abilities of these models for astronomical data. Remarkably, StarWhisper LC Series exhibit high accuracies around 90\%, significantly reducing the need for explicit feature engineering, thereby paving the way for streamlined parallel data processing and the progression of multifaceted multimodal models in astronomical applications. The study furnishes two detailed catalogs illustrating the impacts of phase and sampling intervals on deep learning classification accuracy, showing that a substantial decrease of up to 14\% in observation duration and 21\% in sampling points can be realized without compromising accuracy by more than 10\%.

Comments:	35 pages, 20 figures
Subjects:	Instrumentation and Methods for Astrophysics (astro-ph.IM); Solar and Stellar Astrophysics (astro-ph.SR); Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2404.10757 [astro-ph.IM]
	(or arXiv:2404.10757v1 [astro-ph.IM] for this version)

Submission history

From: Yuyang Li [view email]
[v1] Tue, 16 Apr 2024 17:35:25 GMT (14336kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> astro-ph > arXiv:2404.10757

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Astrophysics > Instrumentation and Methods for Astrophysics

Title: Deep Learning and LLM-based Methods Applied to Stellar Lightcurve Classification

Submission history