Hi, thank you for the valuable and interesting work in "M2D2: A Massively Multi-Domain Language Modeling Dataset"! We would like to use the M2D2 corpus in our current paper and wanted to ask if the Medicine domain for S2ORC articles is available in M2D2. Figure 2 in the paper shows Medicine as one of the domains for S2ORC, but then later in the experiments and lists of S2ORC domains and subdomains as well as the Hugging Face, the Medicine domains/subdomains don't seem to be present.
Hi, thank you for the valuable and interesting work in "M2D2: A Massively Multi-Domain Language Modeling Dataset"! We would like to use the M2D2 corpus in our current paper and wanted to ask if the Medicine domain for S2ORC articles is available in M2D2. Figure 2 in the paper shows Medicine as one of the domains for S2ORC, but then later in the experiments and lists of S2ORC domains and subdomains as well as the Hugging Face, the Medicine domains/subdomains don't seem to be present.