Marvin: A Toolkit for Streamlined Access and Visualization of the SDSS-IV MaNGA Data Set
The Mapping Nearby Galaxies at Apache Point Observatory (MaNGA) survey, one of three core programs of the fourth-generation Sloan Digital Sky Survey (SDSS-IV), is producing a massive, high-dimensional integral field spectroscopic data set. However, leveraging the MaNGA data set to address key questi...
Saved in:
| Published in | arXiv.org |
|---|---|
| Main Authors | , , , , , , , , , , , , , |
| Format | Paper Journal Article |
| Language | English |
| Published |
Ithaca
Cornell University Library, arXiv.org
06.12.2018
|
| Subjects | |
| Online Access | Get full text |
| ISSN | 2331-8422 |
| DOI | 10.48550/arxiv.1812.03833 |
Cover
| Abstract | The Mapping Nearby Galaxies at Apache Point Observatory (MaNGA) survey, one of three core programs of the fourth-generation Sloan Digital Sky Survey (SDSS-IV), is producing a massive, high-dimensional integral field spectroscopic data set. However, leveraging the MaNGA data set to address key questions about galaxy formation presents serious data-related challenges due to the combination of its spatially inter-connected measurements and sheer volume. For each galaxy, the MaNGA pipelines produce relatively large data files to preserve the spatial correlations of the spectra and measurements, but this comes at the expense of storing the data set in a coarsely-chunked manner. The coarse chunking and total volume of the data make it time-consuming to download and curate locally-stored data. Thus, accessing, querying, visually exploring, and performing statistical analyses across the whole data set at a fine-grained scale is extremely challenging using just FITS files. To overcome these challenges, we have developed \marvin: a toolkit consisting of a Python package, Application Programming Interface (API), and web application utilizing a remote database. \marvin's robust and sustainable design minimizes maintenance, while facilitating user-contributed extensions such as high level analysis code. Finally, we are in the process of abstracting out \marvin's core functionality into a separate product so that it can serve as a foundation for others to develop \marvin-like systems for new science applications. |
|---|---|
| AbstractList | The Mapping Nearby Galaxies at Apache Point Observatory (MaNGA) survey, one of three core programs of the fourth-generation Sloan Digital Sky Survey (SDSS-IV), is producing a massive, high-dimensional integral field spectroscopic data set. However, leveraging the MaNGA data set to address key questions about galaxy formation presents serious data-related challenges due to the combination of its spatially inter-connected measurements and sheer volume. For each galaxy, the MaNGA pipelines produce relatively large data files to preserve the spatial correlations of the spectra and measurements, but this comes at the expense of storing the data set in a coarsely-chunked manner. The coarse chunking and total volume of the data make it time-consuming to download and curate locally-stored data. Thus, accessing, querying, visually exploring, and performing statistical analyses across the whole data set at a fine-grained scale is extremely challenging using just FITS files. To overcome these challenges, we have developed a toolkit consisting of a Python package, Application Programming Interface (API), and web application utilizing a remote database. 's robust and sustainable design minimizes maintenance, while facilitating user-contributed extensions such as high level analysis code. Finally, we are in the process of abstracting out 's core functionality into a separate product so that it can serve as a foundation for others to develop -like systems for new science applications. The Mapping Nearby Galaxies at Apache Point Observatory (MaNGA) survey, one of three core programs of the fourth-generation Sloan Digital Sky Survey (SDSS-IV), is producing a massive, high-dimensional integral field spectroscopic data set. However, leveraging the MaNGA data set to address key questions about galaxy formation presents serious data-related challenges due to the combination of its spatially inter-connected measurements and sheer volume. For each galaxy, the MaNGA pipelines produce relatively large data files to preserve the spatial correlations of the spectra and measurements, but this comes at the expense of storing the data set in a coarsely-chunked manner. The coarse chunking and total volume of the data make it time-consuming to download and curate locally-stored data. Thus, accessing, querying, visually exploring, and performing statistical analyses across the whole data set at a fine-grained scale is extremely challenging using just FITS files. To overcome these challenges, we have developed \marvin: a toolkit consisting of a Python package, Application Programming Interface (API), and web application utilizing a remote database. \marvin's robust and sustainable design minimizes maintenance, while facilitating user-contributed extensions such as high level analysis code. Finally, we are in the process of abstracting out \marvin's core functionality into a separate product so that it can serve as a foundation for others to develop \marvin-like systems for new science applications. |
| Author | Andrews, Brett H Argudo-Fernández, María Cherinka, Brian Blanton, Michael Westfall, Kyle Rowlands, Kate Brownstein, Joel Weijmans, Anne-Marie Yan, Renbin Jones, Amy Bundy, Kevin Sánchez-Gallego, José Law, David R Masters, Karen |
| Author_xml | – sequence: 1 givenname: Brian surname: Cherinka fullname: Cherinka, Brian – sequence: 2 givenname: Brett surname: Andrews middlename: H fullname: Andrews, Brett H – sequence: 3 givenname: José surname: Sánchez-Gallego fullname: Sánchez-Gallego, José – sequence: 4 givenname: Joel surname: Brownstein fullname: Brownstein, Joel – sequence: 5 givenname: María surname: Argudo-Fernández fullname: Argudo-Fernández, María – sequence: 6 givenname: Michael surname: Blanton fullname: Blanton, Michael – sequence: 7 givenname: Kevin surname: Bundy fullname: Bundy, Kevin – sequence: 8 givenname: Amy surname: Jones fullname: Jones, Amy – sequence: 9 givenname: Karen surname: Masters fullname: Masters, Karen – sequence: 10 givenname: David surname: Law middlename: R fullname: Law, David R – sequence: 11 givenname: Kate surname: Rowlands fullname: Rowlands, Kate – sequence: 12 givenname: Anne-Marie surname: Weijmans fullname: Weijmans, Anne-Marie – sequence: 13 givenname: Kyle surname: Westfall fullname: Westfall, Kyle – sequence: 14 givenname: Renbin surname: Yan fullname: Yan, Renbin |
| BackLink | https://doi.org/10.48550/arXiv.1812.03833$$DView paper in arXiv https://doi.org/10.3847/1538-3881/ab2634$$DView published paper (Access to full text may be restricted) |
| BookMark | eNotj8FOwkAURSdGExH5AFdO4ro4ndfXKe4aUCQBXZQQd81rO42DpYPTgahfL4Kruzm5OeeKnbe21YzdhGIYJYjintyX2Q_DJJRDAQnAGetJgDBIIikv2aDr1kIIGSuJCD32tiC3N-0DT_nS2ubDeF5bxzPvNG0a0-qKp2Wpu45TW_GV6XbUmB_yxrbc1ty_a55NsiyYrfiCXqYpn5Annml_zS5qajo9-N8-Wz49LsfPwfx1Ohun84BQiqDEuqwwqVUpIoVCRQiRgjpUsoxRUVFUoxhljJWoJWAx0gXFAkgigNAjCKHPbk-3x-x868yG3Hf-l58f8w_E3YnYOvu5053P13bn2oNTLkOMIlQqEfALzu1cIA |
| ContentType | Paper Journal Article |
| Copyright | 2018. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. http://arxiv.org/licenses/nonexclusive-distrib/1.0 |
| Copyright_xml | – notice: 2018. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. – notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0 |
| DBID | 8FE 8FG ABJCF ABUWG AFKRA AZQEC BENPR BGLVJ CCPQU DWQXO HCIFZ L6V M7S PHGZM PHGZT PIMPY PKEHL PQEST PQGLB PQQKQ PQUKI PRINS PTHSS GOX |
| DOI | 10.48550/arxiv.1812.03833 |
| DatabaseName | ProQuest SciTech Collection ProQuest Technology Collection Materials Science & Engineering Collection ProQuest Central (Alumni) ProQuest Central UK/Ireland ProQuest Central Essentials Local Electronic Collection Information ProQuest Central Technology Collection ProQuest One Community College ProQuest Central Korea SciTech Premium Collection ProQuest Engineering Collection Engineering Database ProQuest Central Premium ProQuest One Academic Publicly Available Content Database ProQuest One Academic Middle East (New) ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Applied & Life Sciences ProQuest One Academic ProQuest One Academic UKI Edition ProQuest Central China Engineering Collection arXiv.org |
| DatabaseTitle | Publicly Available Content Database Engineering Database Technology Collection ProQuest One Academic Middle East (New) ProQuest Central Essentials ProQuest One Academic Eastern Edition ProQuest Central (Alumni Edition) SciTech Premium Collection ProQuest One Community College ProQuest Technology Collection ProQuest SciTech Collection ProQuest Central China ProQuest Central ProQuest One Applied & Life Sciences ProQuest Engineering Collection ProQuest One Academic UKI Edition ProQuest Central Korea Materials Science & Engineering Collection ProQuest Central (New) ProQuest One Academic ProQuest One Academic (New) Engineering Collection |
| DatabaseTitleList | Publicly Available Content Database |
| Database_xml | – sequence: 1 dbid: GOX name: arXiv.org url: http://arxiv.org/find sourceTypes: Open Access Repository – sequence: 2 dbid: 8FG name: ProQuest Technology Collection url: https://search.proquest.com/technologycollection1 sourceTypes: Aggregation Database |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Physics |
| EISSN | 2331-8422 |
| ExternalDocumentID | 1812_03833 |
| Genre | Working Paper/Pre-Print |
| GroupedDBID | 8FE 8FG ABJCF ABUWG AFKRA ALMA_UNASSIGNED_HOLDINGS AZQEC BENPR BGLVJ CCPQU DWQXO FRJ HCIFZ L6V M7S M~E PHGZM PHGZT PIMPY PKEHL PQEST PQGLB PQQKQ PQUKI PRINS PTHSS GOX |
| ID | FETCH-LOGICAL-a520-c5fcd58f7c047507453473f172c657abbd965265d0f235b9eba603a25330e9313 |
| IEDL.DBID | GOX |
| IngestDate | Tue Sep 30 19:23:09 EDT 2025 Mon Jun 30 09:22:10 EDT 2025 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-a520-c5fcd58f7c047507453473f172c657abbd965265d0f235b9eba603a25330e9313 |
| Notes | SourceType-Working Papers-1 ObjectType-Working Paper/Pre-Print-1 content type line 50 |
| OpenAccessLink | https://arxiv.org/abs/1812.03833 |
| PQID | 2154457780 |
| PQPubID | 2050157 |
| ParticipantIDs | arxiv_primary_1812_03833 proquest_journals_2154457780 |
| PublicationCentury | 2000 |
| PublicationDate | 20181206 2018-12-06 |
| PublicationDateYYYYMMDD | 2018-12-06 |
| PublicationDate_xml | – month: 12 year: 2018 text: 20181206 day: 06 |
| PublicationDecade | 2010 |
| PublicationPlace | Ithaca |
| PublicationPlace_xml | – name: Ithaca |
| PublicationTitle | arXiv.org |
| PublicationYear | 2018 |
| Publisher | Cornell University Library, arXiv.org |
| Publisher_xml | – name: Cornell University Library, arXiv.org |
| SSID | ssj0002672553 |
| Score | 1.6760937 |
| SecondaryResourceType | preprint |
| Snippet | The Mapping Nearby Galaxies at Apache Point Observatory (MaNGA) survey, one of three core programs of the fourth-generation Sloan Digital Sky Survey (SDSS-IV),... |
| SourceID | arxiv proquest |
| SourceType | Open Access Repository Aggregation Database |
| SubjectTerms | Application programming interface Applications programs Correlation analysis Datasets Downloading Galactic evolution Galaxies Mapping Physics - Astrophysics of Galaxies Physics - Instrumentation and Methods for Astrophysics Programming languages Sky surveys (astronomy) Star & galaxy formation Statistical analysis |
| SummonAdditionalLinks | – databaseName: ProQuest Central dbid: BENPR link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3PT8IwFG4QYuLNnwFF04PXwVjXbjMxBgVEEwgRJNyW17VNiLohTOOfb1s2PZh47W5f1_e-9-t7CF0yN4kgpNyJOKeOH1DpcOCBQwXR8YhS2seYfMdozIbP_uOCLipoXM7CmLbK0iZaQy2yxOTI256VjQmC0L1ZvTtma5SprpYrNKBYrSCurcTYDqp5Rhmrimq3_fHk6Sfr4rFAc2iyLW9aMa82rL-Wny3j6FquDteIZqn26I9xth5nsI9qE1jJ9QGqyPQQ7dpGzWRzhBYj0I87vcJdPMuy15dljjXtxKa4DG-GMgrctTsQMaQCz5cbMzS5HbXEmcKa7uFpbzp1HuZ4BOP7Lu5BDngq82M0G_Rnd0On2I7gANUhX0JVImiogsTVCGsiQIkfEKX5SMJoAJyLiBnpe-Eqj1AeSQ7MJeCZblIZkQ45QdU0S2Ud4chnvlSeNBU-X3oi0jEc7QBlCjwWJqKB6haReLUVwIgNWLEFq4GaJUhx8fNv4t-rOv3_8xna0_wjtN0hrImq-fpDnmsfn_OL4uK-AbPVo44 priority: 102 providerName: ProQuest |
| Title | Marvin: A Toolkit for Streamlined Access and Visualization of the SDSS-IV MaNGA Data Set |
| URI | https://www.proquest.com/docview/2154457780 https://arxiv.org/abs/1812.03833 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV3PT4MwGP2yzYsXo1EzdS49eEUZ0ALe0P3ShGlkLtxIC22yqGA2NJ782_1atngwXjiQcnmlvPf4-r0CXDA7D3lAhRUKQS3Pp9ISXPgWLVz0I0ohx-j_HfGMTZ-9-5SmLSDbXhi--lp-NvnAYn2l6efSRhPltqGNQkE38z6kTXHSRHFtxv-OQ41pbv35tBq-GO_D3kbokaiZmQNoyfIQ0pjj0iyvSUTmVfX6sqwJikaiS8P8TQu-gkTmBEOCBp8slmvd8tg0SpJKERRrJBkmiXW3IDGfTSIy5DUniayPYD4ezW-n1uZsA4tTNGw5VXlBA-XnNuKDNE5dz3cVqomcUZ8LUYRMB9cXtnJcKkIpOLNd7ui9oDJ0B-4xdMqqlF0gocc8qRyp63OedIoQHRgdcMoUd1iQFyfQNYhk7018RabBygxYJ9DbgpRtXt115ph8Ht8P7NP_nzyDXVQOgdnXwXrQqVcf8hzZuRZ9aAfjSR92bkazx6e-mTC8xt-jHxP9kA0 |
| linkProvider | Cornell University |
| linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1db9MwFL0aqxC8wTa0wRh-GI_ZMjt2EqQJdXRby9ZqomXqW3Qd21IFJKUNXz-O_8a1m7IHJN72mkhRdG3fc47vF8ChisscM6mjXGsZJam0kUadRtII0iPOEcb4-47hSPU_Ju-ncroBv9e1MD6tcu0Tg6M2denvyI95aBuTpln8dv418lOjfHR1PUID29EK5jS0GGsLO67srx8k4Zangx6t92vOL84n7_pRO2UgQknSqZSuNDJzaRnTnxKgSpGkwhGul0qmqLXJlW8hb2LHhdS51ahigdxnZdpcnAj67APoJCLJSft1zs5HNx_-XvJwlRJlF6toaugddoyLn7PvRx5Xj2JSh4JIcXj0DxYEgLt4Ap0bnNvFU9iw1RY8DHmh5XIbpkMkX1K9YV02qevPn2YNI5bLfCwbv3iGalg3jFxkWBl2O1v6Gs1VZSerHSN2yca98Tga3LIhji67rIcNsrFtdmByH2Z6BptVXdldYHmiEuu49QHFxHKTk2SUJyiVQ66y0uzBbrBIMV_12yi8sYpgrD3YXxupaM_asrjbGc____oVPOpPhtfF9WB09QIeE_XJQmKK2ofNZvHNviR60eiDdhEZFPe8bf4A6I3dzg |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Marvin%3A+A+Toolkit+for+Streamlined+Access+and+Visualization+of+the+SDSS-IV+MaNGA+Data+Set&rft.jtitle=arXiv.org&rft.au=Cherinka%2C+Brian&rft.au=Andrews%2C+Brett+H&rft.au=S%C3%A1nchez-Gallego%2C+Jos%C3%A9&rft.au=Brownstein%2C+Joel&rft.date=2018-12-06&rft.pub=Cornell+University+Library%2C+arXiv.org&rft.eissn=2331-8422&rft_id=info:doi/10.48550%2Farxiv.1812.03833 |