{"id":1125597,"date":"2024-05-31T05:50:54","date_gmt":"2024-05-31T09:50:54","guid":{"rendered":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/uncategorized\/astronomy-generates-mountains-of-data-thats-perfect-for-ai-universe-today\/"},"modified":"2024-05-31T05:50:54","modified_gmt":"2024-05-31T09:50:54","slug":"astronomy-generates-mountains-of-data-thats-perfect-for-ai-universe-today","status":"publish","type":"post","link":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/astronomy\/astronomy-generates-mountains-of-data-thats-perfect-for-ai-universe-today\/","title":{"rendered":"Astronomy Generates Mountains of Data. That&#8217;s Perfect for AI &#8211; Universe Today"},"content":{"rendered":"<p><p>    Consumer-grade AI is finding its way into peoples daily lives    with its ability to generate text and images and automate    tasks. But astronomers need much more powerful, specialized AI.    The vast amounts of observational data generated by modern    telescopes and observatories defies astronomers efforts to    extract all of its meaning.  <\/p>\n<p>    A team of scientists is developing a new AI for astronomical    data called AstroPT. Theyve presented it in a new paper titled    AstroPT: Scaling    Large Observation Models for Astronomy. The paper is    available at arxiv.org, and the lead author is Michael J.    Smith, a data scientist and astronomer from Aspia Space.  <\/p>\n<p>    Astronomers are facing a growing deluge of data, which will    expand enormously when the     Vera Rubin Observatory (VRO) comes online in 2025. The VRO    has the worlds largest camera, and each of its images could    fill 1500 large-screen TVs. During its ten-year mission, the    VRO will generate about 0.5 exabytes of data, which is about    50,000 times more data than is contained in the USAs Library    of Congress.  <\/p>\n<p>    Other telescopes with enormous mirrors are also approaching    first light. The Giant Magellan Telescope, the Thirty Meter    Telescope, and the European Extremely Large Telescope combined    will generate an overwhelming amount of data.  <\/p>\n<p>    Having data that cant be processed is the same as not having    the data at all. Its basically inert and has no meaning until    its processed somehow. When you have too much data, and you    dont have the technology to process it, its like having no    data,     said Cecilia Garraffo, a computational astrophysicist at    the Harvard-Smithsonian Center for Astrophysics.  <\/p>\n<p>    This is where AstroPT comes in.  <\/p>\n<p>    AstroPT stands for Astro Pretrained Transformer, where a    transformer is a particular type of AI. Transformers can change    or transform an input sequence into an output sequence. AI    needs to be trained, and AstroPT has been trained on 8.6    million 512 x 512-pixel images from the DESI Legacy    Survey Data Release 8. DESI is the Dark Energy    Spectroscopic Instrument. DESI studies the effect of Dark    Energy by capturing the optical spectra from tens of millions    of galaxies and quasars.  <\/p>\n<p>    AstroPT and similar AI deal with tokens. Tokens are visual    elements in a larger image that contain meaning. By breaking    images down into tokens, an AI can understand the larger    meaning of an image. AstroPT can transform individual tokens    into coherent output.  <\/p>\n<p>    AstroPT has been trained on visual tokens. The idea is to teach    the AI to predict the next token. The more thoroughly its been    trained to do that, the better it will perform.  <\/p>\n<p>    We demonstrated that simple generative autoregressive models    can learn scientifically useful information when pre-trained on    the surrogate task of predicting the next 16  16 pixel patch    in a sequence of galaxy image patches, the authors write. In    this scheme, each image patch is a token.  <\/p>\n<p>    One of the obstacles to training AI like AstroPT concerns what    AI scientists call the token crisis. To be effective, AI    needs to be trained on a large number of quality tokens. In a    2023 paper, a    separate team of researchers explained that a lack of tokens    can limit the effectiveness of some AI, such as LLMs or Large    Language Models. State-of-the-art LLMs require vast amounts of    internet-scale text data for pre-training, the wrote.    Unfortunately,  the growth rate of high-quality text data on    the internet is much    slower than the growth rate of data required by LLMs.  <\/p>\n<p>    AstroPT faces the same problem: a dearth of quality tokens to    train on. Like other AI, it uses LOMs or Large Observation    Models. The team says their results so far suggest that AstroPT    can solve the token crisis by using data from observations.    This is a promising result that suggests that data taken from    the observational sciences would complement data from other    domains when used to pre-train a single multimodal LOM, and so    points towards the use of observational data as one solution to    the token crisis.  <\/p>\n<p>    AI developers are eager to find solutions to the token crisis    and other AI challenges.  <\/p>\n<p>    Without better AI, a data processing bottleneck will prevent    astronomers and astrophysicists from making discoveries from    the vast quantities of data that will soon arrive. Can AstroPT    help?  <\/p>\n<p>    The authors are hoping that it can, but it needs much more    development. They say theyre open to collaborating with others    to strengthen AstroPT. To aid that, they followed current    leading community models as closely as possible. They call it    an open to all project.  <\/p>\n<p>    We took these decisions in the belief that collaborative    community development paves the fastest route towards realising    an open source web-scale large observation model, they write.  <\/p>\n<p>    We warmly invite potential collaborators to join us, they    conclude.  <\/p>\n<p>    Itll be interesting to see how AI developers will keep up with    the vast amount of astronomical data coming our way.  <\/p>\n<p>      Like Loading...    <\/p>\n<p><!-- Auto Generated --><\/p>\n<p>Original post: <\/p>\n<p><a target=\"_blank\" rel=\"nofollow noopener\" href=\"https:\/\/www.universetoday.com\/167153\/astronomy-generates-mountains-of-data-thats-perfect-for-ai\/\" title=\"Astronomy Generates Mountains of Data. That's Perfect for AI - Universe Today\">Astronomy Generates Mountains of Data. That's Perfect for AI - Universe Today<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p> Consumer-grade AI is finding its way into peoples daily lives with its ability to generate text and images and automate tasks. But astronomers need much more powerful, specialized AI <a href=\"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/astronomy\/astronomy-generates-mountains-of-data-thats-perfect-for-ai-universe-today\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[257798],"tags":[],"class_list":["post-1125597","post","type-post","status-publish","format-standard","hentry","category-astronomy"],"_links":{"self":[{"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/posts\/1125597"}],"collection":[{"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/comments?post=1125597"}],"version-history":[{"count":0,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/posts\/1125597\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/media?parent=1125597"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/categories?post=1125597"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.euvolution.com\/prometheism-transhumanism-posthumanism\/wp-json\/wp\/v2\/tags?post=1125597"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}