Daniel Mietchen
On this page, old discussions are archived after 90 days. An overview of all archives can be found at this page's archive index. The current archive is located at 2024/08. |
Wikidata Eleventh Birthday - India Datathon
editGreetings Daniel Mietchen!
As part of Wikidata's 11th birthday celebration a 11 days long datathon is planned from October 26, 2023 00:00 to November 5, 2023, 23:59 (both times Indian Standard Time) to improve the India related data in Wikidata. The data-thon aims to enhance Wikidata items by incorporating labels and descriptions in your native language, along with the addition of references, qualifiers, and statements. Additionally, it encourages the utilization of India-related properties within these items, ultimately contributing to the improvement of structured data in Wikidata.
Please have a look at the event page and and join the datathon by adding your name in the participant's section at here.
You are receiving this message as you are one of the participants of WikiProject India on Wikidata. If you do not want to receive this kind of notification further, you can remove your username from here.
Regards,
- The above post does not have a date, which complicates archiving. But this one has, so the section should be picked up in due course. --Daniel Mietchen (talk) 11:58, 21 May 2024 (UTC)
Wikidata weekly summary #608
edit- Welcome to 2023’s Final Weekly Summary!
A big thank you to everyone who contributed to the newsletter this year!👏🙏 As we step into 2024, we'd love to hear what changes you would like to see in the newsletter. Share your wishlist here: What changes would you like to see in the newsletter in 2024?"
- Discussions
- Import sitelinks, labels, descriptions from ku wikipedia pages which use the template w:ku:Template:Înterwîkî etîket û danasîn. (There are over 1800 articles that use this template waiting to be connected to Wikidata at the moment.)
- Add sitelinks to kuwiktionary / kuwikipedia categories / create an item for the category if necessary. I have been doing this manually for quite some time using Quickstatements but since I need to get permission for the first task, I will be handling them using a bot as well.
- Events
- Upcoming: Introducing WMF Wishathon for Wikimedia’s Community Wishlist! "focused on bringing together people who already contribute to technical aspects of the Wikimedia projects, who know how to find their way on the technical ecosystem, and who are able to work or collaborate on projects rather autonomously." March 15th to 17th, 2024.
- Ongoing: Weekly Lexeme Challenge #122: Rock-forming minerals
- Press, articles, blog posts, videos
- Blogs
- African Librarians empowered to share knowledge and enhance information visibility through AfLIA Wikidata Online Course --> The "Promoting Open Knowledge Practices in African Libraries through Wikidata" project, executed by AfLIA with support from the Wikimedia Foundation, trained African librarians on using Wikidata to enhance the visibility of library collections and close the knowledge and gender gap on Africa. The course was facilitated by experienced African Wikimedian editors and included diverse strategies for learner engagement and support.
- Papers: Increasing Coverage and Precision of Textual Information in Multilingual Knowledge Graphs by (Conia et al, 2023) --> This paper introduces a novel task of automatic Knowledge Graph Enhancement (KGE) to bridge the gap in the quantity and quality of textual information between English and non-English languages in Wikidata. It presents M-NTA, an unsupervised approach that combines Machine Translation, Web Search, and Large Language Models to generate high-quality textual information, and studies its impact on Entity Linking, Knowledge Graph Completion, and Question Answering tasks.
- Videos
- Wikidata, Wikisource and Wiktionary: Wikisource for DH (WiSe 2023) --> The lecture "Fundamentals and application-oriented methods of the Digital Humanities" by Kay-Michael Würzner is designed as a series of lectures in which teachers in the "Digital Humanities" course present their fields of work and key topics and present them for discussion.
- Empowering Open-Source Generative AI by Integrating the Wikidata knowledge graph --> Generative AI has changed the information ecosystem, and open-source knowledge graphs like Wikidata can become invaluable assets, propelling a myriad of applications forward. Jonathan Fraine & Lydia Pintscher present the practical integration of Wikidata's open-source, open-access knowledge graph to empower Generative AI applications. Harnessing the real-time updated, structured data encapsulated within Wikidata, they explore automated content creation, data augmentation, and semantic analysis, underpinning the generative paradigms. Through a blend of theoretical insights and real-world applications, they elucidate how to leverage Wikidata to elevate generative AI applications, breaking down existing data silos, and fostering a collaborative ecosystem within our global community of developers and contributors.
- Wiki Indaba 2023 - African content on Wikidata --> Discussion with Alice Kibombo, Georges Fodouop and Jesse Asiedu-Akrofi, about Wikidata for African Librarians during the Wiki Indaba conference, that took place between 3-5 November 2023 in Agadir, Morocco.
- No Time to Wait - S07E10 - ACMI // Wikidata - Paul Duchesne + Simon Loffler --> Report on recent residency program to extensively link together collection data from ACMI with Wikidata. This work has allowed the organisation to import vast quantities of data and media to enrich their own internet collection experience, as well enable writing information back to source and federating with other linked institutions.
- Wiki(s)data #5: Wikidata Live editing (in Italian) --> The ontology of Wikidata: how to interact with it for a better quality, by Epìdosis
- Notebooks
- Map of K-Pop Idols --> An interactive map where each red dot represents a K-pop Idol (a singer or musician in South Korean Pop music) you are able to click on.
- Disney as the Mega Corporation it is Today --> Disney has greatly evolved from the simple animation company that first debuted in 1923 with its signature Steamboat Willie animation. This analysis details some of the major acquisitions Disney has chosen to help expand its reach as a media and entertainment company.
- The Gender-Equality Gap in STEM Awards --> A network graph and multiple data visualizations on UCLA's alumnni awards based on gender.
- Exploring The Belichick Coaching Tree --> This analyses details the coaching tree of the prolific American Football coach Bill Belichick.
- State of statues in the US --> Map of how many statues there are, who is depicted in the statues, their genders, and where the statues are concentrated.
- An Analysis on Nepo Babies: Net Worths and Fame --> This work uses Wikidata to analyze the influence and success of children of famous actors (nepo babies) in the entertainment industry, and compares the careers and net worth of these children with their parents to understand the impact of nepotism on their success.
- Blogs
- Tool of the week
- Cersei - is a tool designed for importing or scraping data from various third-party sources, using source-specific Python code. It can use a "headless browser" to scrape complicated websites that rely on eg JavaScript to navigate. It can therefore access data sources that can not be accessed via eg Mix'n'match. The data from sources can be updated regularly, either for everything, or just changed entries (if the source has a "recent changes" equivalent).
- Wikidata:Zotero/Cita - is a Wikidata addon for Zotero that adds citations (i.e., what other items an item cites) metadata support to this open source reference management software, using cites work (P2860) information available from Wikidata, and enabling users to easily contribute missing data.
- Other Noteworthy Stuff =
- Job opening: Data Scientist / Knowledge Engineer to use Wikidata as a foundational layer for an US National Science Foundation (NSF) funded Prototype Open Knowledge Network.
- Did you know?
- Newest properties:
- General datatypes: none
- External identifiers: WHDLoad database ID, Shanghai Library movie ID, PCSX2 Wiki ID, KRS number, Twitch numeric channel ID, RPCS3 Wiki ID, Black Games Archive ID, Citra compatibility database ID, DraCor ID, ORBi article ID, IGN wiki article ID, AreWeAntiCheatYet ID, RPGFan game ID, Arcade Hub ID
- New property proposals to review:
- General datatypes:
- Laws of Malaysia URL (Uniform Resource Locator for laws of Malaysia)
- production manager (manager that is responsible for the administration of a feature film or television production; oversees production plans, controls resources, initiates production, ensures ongoing operations, monitors schedules and expenditures, and creates a detailed production schedule and budget)
- External identifiers: Schnittberichte.com ID, National Library of Malaysia OPAC ID, HistoriaGames series ID, Kemono Games game ID, Internet Game Database event ID, GamesMeter ID, Walk Score ID, Malaysia company new number, Am Faclair Beag ID, xemu compatibility database ID, Sofascore player ID, GameGear.jp ID, RPGWatch IDs, Team England ID, TORCH taxon ID, ScummVM ID, Abandonware France IDs
- General datatypes:
- Query examples:
- Newest WikiProjects: WikiProject Städel Museum Wikidata Clean-Up - This WikiProject from the Städel Museum aims to actively participate in the Wikimedia community by maintaining and updating the quality of its data. This includes their collection of public domain art, which has been digitized and made freely available for public use. The project focuses on ensuring that the most current and high-quality data, including high-resolution images and improved metadata, are available on platforms like Wikimedia Commons and Wikidata.
- Newest database reports: children of dead mothers - List of mother-children pairs, where death date of parent < birth date of child
- Showcase Items: Esperanto (Q143) - international auxiliary language designed by L. L. Zamenhof
- Showcase Lexemes: L1222568 (বড়দিন) - Bengali noun for 'Christmas'
- Newest properties:
- Development
- Due to the winter holidays, the development team is taking a break and no deployment is happening for Wikidata at the moment.
You can see all open tickets related to Wikidata here. If you want to help, you can also have a look at the tasks needing a volunteer.
- Monthly Tasks
- Add labels, in your own language(s), for the new properties listed above.
- Comment on property proposals: all open proposals
- Contribute to a Showcase item.
- Help translate or proofread the interface and documentation pages, in your own language!
- Help merge identical items across Wikimedia projects.
- Help write the next summary!
- The above post does not have a date, which complicates archiving. But this one has, so the section should be picked up in due course. --Daniel Mietchen (talk) 11:58, 21 May 2024 (UTC)
Wikidata weekly summary #609
edit- Discussions
- Open request for adminship: WikiBayer (RfP scheduled to end after 8 January 2024 12:01 UTC)
- Closed request for adminship: EPIC (closed as successful). Welcome onboard \o/
- New requests for permissions/Bot: HVSH-Bot . Task: Import data about politicians from the Q119949776, now only partially online available.
- Events
- Upcoming: The next Wikidata+Wikibase office hours will take place on Wednesday, 17:00 UTC, 17th January 2023 (18:00 Berlin time) in the Wikidata Telegram group. The Wikidata and Wikibase office hours are online events where the development team presents what they have been working on over the past quarter, and the community is welcome to ask questions and discuss important issues related to the development of Wikidata and Wikibase.
- Ongoing: Weekly Lexeme Challenge #123: Ologist
- Press, articles, blog posts, videos
- Papers: Improving maintenance of community-based knowledge graphs. This paper by Nicolas Ferranti addresses the critical issue of data quality in open knowledge graphs, with a specific focus on Wikidata. It aims to formalize Wikidata's unique approaches to assess and resolve data inconsistencies, proposing a semi-automatic refinement pipeline to empower the Wikidata user community in maintaining and enhancing the reliability of this extensive collaborative knowledge graph.
- Videos: WikidataCon 2023 Day 1.5 - The past and future of Wikidata. In this video Lydia Pintscher takes a moment to review the major events of Wikidata over the past few years. Then turns to look forward and predict what Wikidata's prospects will be over the next year.
- Tool of the week
- WICA: Wikidata's insights for created articles is an updated version of an old tool. It now includes many new features to analyse your list of created articles using Wikidata properties.
- Did you know?
- Newest properties:
- General datatypes: none
- External identifiers: Shamela book edition ID, HistoriaGames series ID, Schnittberichte.com title ID, Kemono Games game ID, Internet Game Database event ID, Xemu compatibility database ID, GamesMeter game ID, GameGear.jp ID, Walmart product ID, Swissubase person ID, RPGWatch game ID, RPGWatch company ID, RPGWatch press ID, Indie DB company ID, NIWA article ID, turismoroma.it place ID, ScummVM ID, ORBi author ID, Abandonware-France video game series ID, Abandonware-France video game compilation ID, Abandonware-France person ID, Abandonware-France company ID, Abandonware-France magazine ID, Abandonware-France award ID, Kanjipedia word ID, Moviefone movie ID, South African NPO number, Nigerian registered company ID, Abandonware-France video game ID, AFJV directory ID
- New property proposals to review:
- General datatypes:
- Nonprofit Status (Indicating the legal and tax status of a non-profit organization (specific to served legal areas, aka. Countries). Addition to {{P|1454}}. {{P|1628}} to [https://schema.org/nonprofitStatus nonprofitStatus] from schema.org. Organizations can have multiple Nonprofit Status from different countries.)
- International Classification of Nonprofit Organizations ({{Q|2976602}} for {{Q|163740}} created by the {{Q|193727}} and adapted by the {{Q|1065}}.)
- creative director (person who makes high-level creative decisions, oversees the creation of creative assets such as adverts, products, events or logos and guides and directs the creative people who create the end result)
- television judge ()
- External identifiers: SERNEC taxon ID, Consortium of Bryophyte Herbaria taxon ID, Rhineland-Palatinate school ID, nebula channel id, Deutsche Bahn station number, ISzDb series ID, BG localisation unit ID, Cathopedia article ID, Native Plants Hawaii ID, Taiwan Biographical Database ID, Penstemon Database ID, Wikisage ID
- General datatypes:
- Query examples:
- Newest database reports: Merge candidates: Identical birth and death dates
- Showcase Items: Team Fortress 2 (Q382108) - team-based first-person shooter multiplayer video game.
- Showcase Lexemes: ورھا لگّݨ / ਵਰ੍ਹਾ ਲੱਗਣ (L907713) - Punjabi verb expressing the setting in of a new year.
- Newest properties:
- Development
- The development team is just returning from the winter holidays so there is no development update at the moment.
You can see all open tickets related to Wikidata here. If you want to help, you can also have a look at the tasks needing a volunteer.
- Monthly Tasks
- Add labels, in your own language(s), for the new properties listed above.
- Comment on property proposals: all open proposals
- Contribute to a Showcase item.
- Help translate or proofread the interface and documentation pages, in your own language!
- Help merge identical items across Wikimedia projects.
- Help write the next summary!
- The above post does not have a date, which complicates archiving. But this one has, so the section should be picked up in due course. --Daniel Mietchen (talk) 11:58, 21 May 2024 (UTC)
Wikidata weekly summary #610
editDiscussions
- Closed request for adminship: WikiBayer (closed as successful). Welcome onboard \o/
- New requests for permissions/Bot: So9qBot 9. Task: Add DDO identifier to Danish lexemes.
- Upcoming:
- The next Wikidata+Wikibase office hours will take place on Wednesday, 16:00 UTC on Wednesday, 17th January 2024 (18:00 Berlin time) in the Wikidata Telegram group. The Wikidata and Wikibase office hours are online events where the development team presents what they have been working on over the past quarter, and the community is welcome to ask questions and discuss important issues related to the development of Wikidata and Wikibase.
- Wiki Mentor Africa (WMA) Hackathon, 19th to 21st January 2024
- Forschungsdatenmanagement: Wikidata as a collaborative information resource on research data management (German), takes place online, Wednesday 10th January 2024, 10-11am (CET).
Press, articles, blog posts, videos
- Blogs: PubChem on Wikidata – What is the state of coverage? by Tiago Lubiana. In summary, Wikidata has good coverage of the structured chemical data in PubChem, though there are improvement points. PubChem displays, and will always display, textual information and vendor-specific data that do not fit Wikidata, but they are complementary tools in the ecosystem of open chemical data.
- Papers
- Linked data: un’opportunità per il riuso (Q124079430) "scientific article published in 2023" (paper in Italian) - deals with linked data in library catalogues, with many mentions of Wikidata.
- Automatically Constructed Indonesian Question Answering Dataset by Leveraging Wikidata by K. Doxolodeo & A.A. Krisnadhi - researchers have created a new Indonesian Question Answering dataset that is produced automatically end-to-end using Context Free Grammar, the Wikipedia Indonesian Corpus, and the concept of the proxy model
- LIS Journals’ Lack of Participation in Wikidata Item Creation by Eric Willey & Susan Radovsky, discusses the gap of Wikidata items being created for scholarly articles by the scholar's themselves and if this can lead to inconsistent or inaccurate data model.
- Quantifying Americanization: Coverage of American Topics in Different Wikipedias: this paper asks whether there is an americanisation bias in the content created by the communities. By Piotr Konieczny & Włodzimierz Lewoniewski.
- Videos
- Map Kerala Initiative is an opendata portal geospatial map powered by Wikidata and OpenStreetMap, introduced by Manoj Karingamadathil.
- This video on Biodiversity Explorations with Machine Learning: Biodiversity Data Access Functions shows how Wikidata is being used to populate species entity profiles at Wolfram U, presented by Jofre Espigulé-Pons.
- Notebooks: Wikipedia article as a timeline - This tool transforms a Wikipedia article in a timeline by parsing all internal links in a Wikipedia article and retrieving the date corresponding to each internal link using the point in time (P585) property in Wikidata.
Tool of the week Map your list of created articles - a notebook display of geolocated articles on a map created by a user per chosen project and batch (featured/good article).
Other Noteworthy Stuff Wikimedia Indonesia and Wikimedia Deutschland ended their partnership within the project Software Collaboration for Wikidata prematurely. Read their joint statement here.
Newest properties and property proposals to review
- Newest General datatypes:
- Flora of the Hawaiian Islands URL (URL of the entry for a plant genus, species, subspecies, or variety in the Flora of the Hawaiian Islands website)
- (Montana Plant Life URL (URL for a plant family, genus, or species on the Montana Plant Life website)
- plate(s) (plate number(s) in the reference source being cited to support the statement being made)
- Newest External identifiers: Abandonware-France book ID, MilliBase taxon ID, Monasticon Hibernicum database ID, Rhineland-Palatinate school ID, Enciclopedia di Roma monument ID, Enciclopedia di Roma street ID, Mid-Atlantic Herbaria Consortium taxon ID, The Criterion Collection spine number
- New General datatypes property proposals to review:
- Water bottle volume (Volume of the water bottle)
- Is it metric? (To check if it's a metric.)
- Anti-Cheat software used (anti-cheat solution used by this multiplayer video game)
- New External identifier property proposals to review: turismo.marche.it place ID, Joseph Smith Papers person ID, Team Scotland ID, Globoplay ID, DoblajeVideojuegos game ID, National Natural Parks System ID, Commonwealth Games Australia ID, Adventure-Treff game ID, TouchArcade game ID, Mod.io ID, The Models Resource game ID, The Models Resource entity ID, tourist information point number, Jinji Koshinjyo ID, Bandcamp track ID
Did you know?
- Query examples:
- Newest WikiProject: Podcast Episodes 2024 - The goal of this project is to add episode pages for individual podcasts.
- Newest database report: children of unborn parents
- Showcase Item: Helsinki (Q1757) - capital and most populous city of Finland
- Showcase Lexeme: Allah korusun (L1226849) - Turkish for 'God forbid'
Development
- IP masking/temporary accounts: We are adjusting Wikibase to be prepared for the upcoming changes to no longer expose IP addresses for non-logged-in users (phab:T351968)
- Dumps/lex. data: We’re adjusting how empty lists of Forms and Senses are represented in JSON dumps (phab:T305660)
- Wikibase REST API:
- We finished the work on making it possible to get all sitelinks of an Item (phab:T344041)
- We are working on getting a sitelink for a given wiki (phab:T344039)
You can see all open tickets related to Wikidata here. If you want to help, you can also have a look at the tasks needing a volunteer.
Weekly Tasks
- Add labels, in your own language(s), for the new properties listed above.
- Comment on property proposals: all open proposals
- Contribute to the showcase Item and Lexeme above.
- Participate in this week's Lexeme challenge: Ologies
- Help translate or proofread the interface and documentation pages, in your own language!
- Help merge identical items across Wikimedia projects.
- Help write the next summary!
- The above post does not have a date, which complicates archiving. But this one has, so the section should be picked up in due course. --Daniel Mietchen (talk) 11:58, 21 May 2024 (UTC)
Duplicates for Zoosystematica Rossica
editHi @Daniel Mietchen, I've noticed that User:Research Bot is creating duplicates of existing articles for Zoosystematica Rossica (Q18649566). I think there are 40-50 cases of articles prior to 2007 where the article already has an item in Wikidata (because I added it), but without a DOI. When I added those articles they didn't have one. Now many of them do, and ResearchBot has added some without first checking if they already exist(!).
I don't know how you check for duplicates, but I've learnt from experience that it's not enough to assume that if the DOI isn't in Wikidata that means that there is no item for that DOI. Any chance you can merge these newer items with the existing ones? I did a couple by hand but it's tedious. Rdmpage (talk) 12:57, 20 May 2024 (UTC)
- @Rdmpage: Thanks for the note — I'm on it. --Daniel Mietchen (talk) 19:14, 20 May 2024 (UTC)
- @Rdmpage: I'm done with the merges for now. Noticed a few things on the way: (i) the dates rarely agree — this probably needs some more cleanup; (ii) your Q107057509 was a duplicate of your Two new species of cave crickets of the genus Eremogryllodes (Orthoptera: Myrmecophilidae: Bothriophylacinae) from Iran (Q104465575), (iii) your Q116677031 was a duplicate of the bot's A description of Dendronotus shpataki sp. nov. (Gastropoda: Nudibranchia) from the Sea of Japan: a contribution of citizen science to marine zoology (Q114293958). A test merge via QuickStatements was incomplete, so I did the rest by hand. Do you have plans to fill in the missing DOIs? I normally work by topic rather than journal. --Daniel Mietchen (talk) 20:12, 20 May 2024 (UTC)
- @Daniel Mietchen Thanks for the merges, I did a couple that were missed. I've also gone through them and removed obvious duplicate bibliographic information, such as authors, and flagged CrossRef-sourced dates as preferred. I've left the main subject (P921) as is, figuring that was more your domain. I tend to work by journal as sadly, each journal typically requires its own special handling. Yes, I plan to add the missing DOIs for those articles that have them. Oh, and thanks for spotting the duplicates I'd made. Rdmpage (talk) 11:14, 21 May 2024 (UTC)
- @Rdmpage: I'm done with the merges for now. Noticed a few things on the way: (i) the dates rarely agree — this probably needs some more cleanup; (ii) your Q107057509 was a duplicate of your Two new species of cave crickets of the genus Eremogryllodes (Orthoptera: Myrmecophilidae: Bothriophylacinae) from Iran (Q104465575), (iii) your Q116677031 was a duplicate of the bot's A description of Dendronotus shpataki sp. nov. (Gastropoda: Nudibranchia) from the Sea of Japan: a contribution of citizen science to marine zoology (Q114293958). A test merge via QuickStatements was incomplete, so I did the rest by hand. Do you have plans to fill in the missing DOIs? I normally work by topic rather than journal. --Daniel Mietchen (talk) 20:12, 20 May 2024 (UTC)
Wikimedians for Sustainable Development - May 2024 Newsletter
edit- User group news
- Upcoming: User group meeting, 16 June
- Mini report from the Wikimedia Summit 2024
- User group representative interviewed by Wikipediapodden at Wikimedia Summit (commons)
- Minutes from user group meeting in May
- Other news
- Reflecting _Women For Sustainability Africa Arts + Feminism #Her Voice Campaign 2023
- Macedonia report: Climate change and GLAM (SDG 13)
- Biodiversity Heritage Library April monthly highlights (SDG 14 & 15)
- WikiProject Biodiversity featured in Nature Africa (SDG 14 & 15)
- Wikimedia UK releases a video about their climate focus (SDG 13)
- Events
- Wiki Loves Earth, the international photo contest of protected nature, continues in some countries. (SDG 14 & 15)
This message was sent with Global message delivery by Ainali (talk) 13:18, 1 June 2024 (UTC) • Contribute • Manage subscription
Wikimedians for Sustainable Development - June 2024 Newsletter
edit- User group news
- User group vote on the adoption of the Movement Charter (closes 7 July 23.59 UTC)
- Upcoming user group meeting 21 July
- User group meeting held in June - minutes
- The group was featured in the latest WikiAfrica Hour: #36: Does the Wikimedia movement contribute to the SDGs?
- Other news
- Stories from the anti-disinformation repository: How WikiProject COVID-19 and other Wikimedia initiatives counter health disinformation (SDG 3)
- Environment Centre Northern Territory Wikipedian in Residence (SDG 15)
- With AI can we increase transparency of companies' carbon footprints (in Swedish). Op-ed that mentions that the greenhouse gas emissions of the top 150 companies on the Stockholm stock exchange has been uploaded to Wikidata. The model is documented on WikiProject Climate Change on Wikidata. (SDG 13)
- Another Year in Review: Where is Wikimedia in the Climate Crisis? Seeing the impact of Wikimedia Projects (SDG 13)
- 46 scholars, self-advocates bring knowledge to Wikipedia’s disability healthcare content (SDG 3)
- Wikimedia Sverige publishes their 2023 climate impact report (in Swedish) (SDG 13)
- WikiProject Govdirectory has started weekly collaboration on countries (SDG 16)
- Events
- Wikimedia chapters and groups organise the first Sharks and Rays Wikimarathon (29 June, but edits in the weeks after are welcome) (SDG 14)
This message was sent with Global message delivery by Ainali (talk) 09:27, 1 July 2024 (UTC) • Contribute • Manage subscription
Wrong dates
editHello, these items that your bot claimed were published on "19 July 2019" actually are from the XX century!
I suppose this is due to a general misbehavior of the bot. Are you able to identify the magnitude of the problem? And could you fix it as soon as possible? Horcrux (talk) 18:25, 8 July 2024 (UTC)
- @Horcrux: Thanks for checking. The problem is described here, along with a code fix and some discussion about fixing the data. --Daniel Mietchen (talk) 23:12, 9 July 2024 (UTC)
- Yeah, I don't think I'll go into it further. Anyway, some of those items were duplicated of other ones that, curiously, were correctly dated by Research Bot in 2017 (see e.g. [1][2][3]), apart from the known problem of the date's precision ("1" should not be used as day of the month). --Horcrux (talk) 06:59, 10 July 2024 (UTC)
Could you implement large-scale import of studies data with the Research Bot?
editHi there, thanks for User:Research Bot which seems very useful and like it has imported data on a significant number of studies.
Could the bot be used to bulk-import data on very many studies – please give your input here and/or here. The latter is about Scholia and I think that platform becomes useful in the real world only once Wikidata has data on, basically, most studies.
For example, charts on scholia would be inaccurate if so many studies are missing in Wikidata. The former is also about books and names a few datasets that I think could be used, maybe most notably dumps by Anna's Archive. I also asked the operator of "LargeDatasetBot". I think you know a lot in this space and your input would be very appreciated!
Also <sub> does not work in item titles – for example see Q125340665. Maybe these could be replaced with the ₂ characters retrospectively and the title be altered accordingly for future imports. Prototyperspective (talk) 18:27, 17 July 2024 (UTC)
- @Prototyperspective: Thanks for thinking along. In terms of imports, the first question should always be whether we should do it, and in the affirmative case, we can explore whether and how we could implement any technical solutions. At the moment, anything large-scale is essentially precluded by the looming graph split anyway. In terms of markup in the titles, I think we should implement something, but technical solutions would probably depend as well on how the graph split plays out. --Daniel Mietchen (talk) 21:14, 21 July 2024 (UTC)
- Thank you, I did not know about this. moving the scholarly articles to a separate graph as in the experiment seems like a good idea and better enable what I asked about. Also I think there need to be some measures to prevent covert vandalism which could already be an issue with such a large number of largely unmonitored items, for example by locking all scholarly articles items to be only changed by bots/queries. The same would also be useful for food product items as mentioned in my recent comment here. I don't know which parts are missing when this bot and a few already imported many items and thought datasets like those used by OpenAlex or ScienceOpen could be used. Prototyperspective (talk) 23:08, 22 July 2024 (UTC)
Wikimedians for Sustainable Development - July 2024 Newsletter
edit- User group news
- User group meeting held in July, minutes
- Next user group meeting will be 18 August
- Other news
- Climate change editahon and workshop in Macedonia (SDG 13)
- WikiForHumanRights in Nigeria 2024 Campaign Virtual Launch (SDG 10&16)
- What we Learned from Wiki Women In Red @8 Campaign 2023 Women for Sustainability Africa (SDG 5)
- Ghanaian Wikipedians set to educate students on Open Climate (SDG 13)
- Using Wikipedia as a Tool for Climate Action (SDG 13)
- Events
- 5th August, m:Event:Wiki-Green_Conference_2024 Wiki-Green Conference (SDG 13)
- 7-10 August, Wikimania - All SDG related sessions
- 7-9 November, Justicia climática, voces indígenas y plataformas Wikimedia (SDG 13)
- Participate
- Share an example of a successful WikiProject or topical collaboration in this on-wiki survey
This message was sent with Global message delivery by Ainali (talk) 18:56, 1 August 2024 (UTC) • Contribute • Manage subscription