Astrophysics & AI with Python: Unlocking the Universe with Astroquery

wpnews.pro

cd /news/artificial-intelligence/astrophysics-ai-with-python-unlockin… · home › topics › artificial-intelligence › article

[ARTICLE · art-28492] src=dev.to ↗ pub=2026-06-15T20:00Z topic=artificial-intelligence verified=true sentiment=↑ positive

Astrophysics & AI with Python: Unlocking the Universe with Astroquery

A developer demonstrates how to use the Python library Astroquery to programmatically access astronomical data from multiple archives, solving the heterogeneity problem of different APIs. The tutorial shows how to resolve coordinates for the Andromeda Galaxy using NED and query the MAST archive for Hubble Space Telescope observations, integrating with Astropy for unit handling and coordinate transformations.

read5 min views23 publishedJun 15, 2026

The universe is no longer just observed through a physical telescope eyepiece; it is read, parsed, and analyzed through code. For the modern data-driven astronomer, the sky is a massive, distributed database. However, accessing this data presents a unique challenge: the "Babel of Archives."

How do you programmatically search the accumulated knowledge of humanity when that knowledge is scattered across dozens of independent institutions, each with its own proprietary query language, format, and API?

The answer is Astroquery. This powerful Python library serves as the universal translator for the Virtual Observatory, turning complex web requests into simple function calls. In this guide, we will explore the theoretical foundations of this tool and walk through a practical script to fetch Hubble Space Telescope data for the Andromeda Galaxy.

Modern astronomy is defined by the data deluge. From the Hubble Space Telescope (HST) to the James Webb Space Telescope (JWST) and the Gaia mission, we are collecting petabytes of data. But this data isn't stored on a single central server. It is housed in specialized archives:

If you wanted to find all data on M31, you would historically need to write custom API wrappers for all three archives. This is the Heterogeneity Problem.

Think of astroquery

as a Universal Research Librarian. You give it a simple instruction in Python, and it performs the complex, hidden work behind the scenes:

Crucially, astroquery

integrates tightly with astropy.coordinates

. It handles unit conversions and reference frame transformations (like precessing coordinates from J2000 to the current epoch) automatically, eliminating a massive source of error in scientific research.

Let’s put theory into practice. In this example, we will perform the standard two-step astronomical query:

import astropy.units as u
from astropy.coordinates import SkyCoord
from astroquery.ned import Ned
from astroquery.mast import Mast
import sys 


TARGET_NAME = "M31"

print(f"--- 1. Resolving Coordinates for {TARGET_NAME} using NED ---")

try:
    ned_result_table = Ned.query_object(TARGET_NAME)
except Exception as e:
    print(f"Error querying NED for {TARGET_NAME}: {e}")
    sys.exit(1)

try:
    ra_deg = ned_result_table['RA(deg)'][0]
    dec_deg = ned_result_table['DEC(deg)'][0]
except IndexError:
    print(f"Error: NED returned an empty result for {TARGET_NAME}.")
    sys.exit(1)

target_coord = SkyCoord(
    ra=ra_deg * u.degree, 
    dec=dec_deg * u.degree, 
    frame='icrs' 
)

print(f"Resolved Coordinates: RA={target_coord.ra.deg:.4f} deg, Dec={target_coord.dec.deg:.4f} deg")


search_radius = 0.5 * u.degree 

print(f"\n--- 2. Querying MAST for HST Observations within {search_radius} of M31 ---")

mast_observations = Mast.query_criteria(
    coordinates=target_coord,
    radius=search_radius,
    obs_collection="HST" # Filter for Hubble data only
)

if mast_observations is not None and len(mast_observations) > 0:
    print(f"\nSuccess! Found {len(mast_observations)} HST observations.")
    print("\nMetadata Summary (First 5 entries):")
    summary_data = mast_observations[['obsid', 'instrument_name', 't_exptime', 'filters']][:5]
    print(summary_data)
else:
    print("\nNo HST observations found.")

print("\nQuery process complete.")

We import astropy.units

(aliased as u

) and SkyCoord

. In modern astronomical coding, units are mandatory. Passing a raw number like 0.5

is dangerous—is that 0.5 degrees, radians, or arcseconds? By multiplying 0.5 * u.degree

, we create a unit-aware object that astroquery

understands perfectly.

The function Ned.query_object("M31")

sends a request to the NASA/IPAC Extragalactic Database. It returns an Astropy Table

containing metadata (redshift, object type, etc.). We extract the RA(deg)

and DEC(deg)

columns.

[0]

because even a single name query returns a table (a list of rows). We grab the first row as the primary match.We wrap the raw numbers into target_coord = SkyCoord(...)

. This object is the currency of the Astropy

ecosystem. It carries not just the numbers, but the units (u.degree

) and the frame (icrs

the International Celestial Reference System).

We use Mast.query_criteria()

. This is the Swiss Army knife of MAST queries.

coordinates=target_coord

radius=search_radius

obs_collection="HST"

The result is an Astropy Table

. This is superior to a standard Pandas DataFrame for astronomy because it preserves scientific metadata. It knows the units of every column and the provenance of the data. We slice the table to show the first 5 entries and specific columns (obsid

, instrument_name

, t_exptime

, filters

) to keep the output readable.

The most common error for beginners is forgetting astropy.units

Incorrect:

search_radius = 0.5 # Just a float

Correct:

search_radius = 0.5 * u.degree # A physical quantity

If you pass a bare number, astroquery

will raise an error because it cannot assume the unit. Always use units!

astroquery

is more than a convenience wrapper; it is the glue that holds the fragmented world of astronomical archives together. By abstracting away the complexities of HTTP requests, XML parsing, and coordinate transformations, it allows researchers to focus on the science rather than the plumbing.

Whether you are building a training set for an AI model or analyzing the spectral energy distribution of a galaxy, astroquery

provides the standardized, programmatic access required for reproducible, modern science.

astroquery

to programmatically curate a balanced training dataset of spiral vs. elliptical galaxies?The concepts and code demonstrated here are drawn directly from the comprehensive roadmap laid out in the ebook

Astrophysics & AI: Building Research Agents for Astronomy, Cosmology, and SETI. You can find it here. Check all the other 50 Programming & AI ebooks with python, typescript, swift, c#: here

source & further reading

dev.to — original article More Compute Won't Wake It Up Your AI Coding Agent Is LYING When It Says "Done" Spring AI: Bringing Generative AI into Spring Boot Applications

~/api · this article 200

$curl api.wpnews.pro/v1/news/astrophysics-ai-with-pyt…

Read original on dev.to → dev.to/programmingcentral/astrophysics-ai-with-p…

mentioned entities

Astroquery

Python

NED

MAST

Hubble Space Telescope

Astropy

Andromeda Galaxy

Virtual Observatory

metadata

slugastrophysics-ai-with-python-unlocking-the-universe-with-astroquery

topic#artificial-intelligence

secondary2 topics

sentimentpositive

canonicaldev.to

navigation

← prevSamsung’s $2,100 Galaxy Book6 Ed…

next →Nothing CEO says phone prices wi…

── more in #artificial-intelligence 4 stories · sorted by recency

hanqi-blog.com · 31 Jul · #artificial-intelligence

Rewriting a Six-Year-Old Personal Project with AI

byteiota.com · 31 Jul · #artificial-intelligence

LLMD: Run LLM Inference on Any Chip, One Docker Tag

decrypt.co · 31 Jul · #artificial-intelligence

$38M in Bitcoin Drained by Coldcard Key Flaw Its Maker Thinks AI Found

github.com · 31 Jul · #artificial-intelligence

Show HN: STE-Code a distillation and adaptation of ASD-STE100 for code

── more on @astroquery 3 stories trending now

wpnews · 30 Jul · #artificial-intelligence

Microsoft and Meta Earnings Show Different AI Spending Pressures

wpnews · 31 Jul · #artificial-intelligence

Microsoft doubles down on multi-model AI as it builds a Copilot super app

wpnews · 30 Jul · #artificial-intelligence

Apple to join Samsung in AI glasses race against Meta

sponsored brought to you by zahid.host 4,200+ EU-deployed projects

reading about agents? ship yours in a single git push.

Run your AI side-project on zahid.host

EU-based hosting, git-push deploys, automatic HTTPS, no cold starts. Free tier with a custom domain — perfect for shipping the agent you just read about.

$git push zahid main

→ Live at https://your-agent.zahid.host ✓

Get free account → Pricing

from €0/mo · no card required