- Tokyo, Japan
-
08:02
(UTC +09:00) - https://scholar.google.com/citations?user=c-5kKf4AAAAJ
- https://orcid.org/0000-0002-2496-2699
-
-
presidio-research Public
Forked from microsoft/presidio-researchThis package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as well as for evaluating specific PII recognizers …
Jupyter Notebook MIT License UpdatedMar 2, 2025 -
s4 Public
Forked from cataluna84/s4Structured state space sequence models - Codebase Update
Jupyter Notebook Apache License 2.0 UpdatedFeb 7, 2025 -
presidio Public
Forked from microsoft/presidioContext aware, pluggable and customizable data protection and de-identification SDK for text and images
Python MIT License UpdatedFeb 5, 2025 -
-
allennlp Public
Forked from allenai/allennlpA fork for adding AWD-BiLSTM-CRFs to AllenNLP.
Python Apache License 2.0 UpdatedMar 20, 2024 -
2019-nCoV Public
Forked from kiang/2019-nCoVi18n for https://kiang.github.io/2019-nCoV/
JavaScript MIT License UpdatedJul 23, 2023 -
bigscience-metadata Public
Forked from bigscience-workshop/metadataExperiments on including metadata such as URLs, timestamps, website descriptions and HTML tags during pretraining.
Python Apache License 2.0 UpdatedJun 12, 2023 -
reactphp-child-process-pool Public
Forked from WyriHaximus/reactphp-child-process-poolTry to support bidirectional communications between the pool and the child processes.
PHP MIT License UpdatedApr 17, 2023 -
fastai_dev Public
Forked from fastai/fastai_devdevelopment of the next version of fastai
Jupyter Notebook Apache License 2.0 UpdatedApr 11, 2023 -
airflow Public
Forked from apache/airflowApache Airflow - A platform to programmatically author, schedule, and monitor workflows
Python Apache License 2.0 UpdatedMar 6, 2023 -
bigscience-evaluation Public
Forked from bigscience-workshop/evaluationCode and Data for Evaluation WG
Python Other UpdatedFeb 16, 2023 -
bigscience-promptsource Public
Forked from bigscience-workshop/promptsourceToolkit for collecting and applying templates of prompting instances
Python Apache License 2.0 UpdatedAug 6, 2022 -
Megatron-DeepSpeed Public
Forked from bigscience-workshop/Megatron-DeepSpeedOngoing research training transformer language models at scale, including: BERT & GPT-2
Python Other UpdatedAug 1, 2022 -
docker-py Public
Forked from docker/docker-pyA Python library for the Docker Engine API
Python Apache License 2.0 UpdatedJun 15, 2022 -
datasets Public
Forked from huggingface/datasets🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Python Apache License 2.0 UpdatedApr 7, 2022 -
flair Public
Forked from flairNLP/flairA very simple framework for state-of-the-art Natural Language Processing (NLP)
Python Other UpdatedMar 15, 2022 -
REL Public
Forked from informagi/RELREL: Radboud Entity Linker
Python MIT License UpdatedMar 7, 2022 -
docker-airflow Public
Forked from puckel/docker-airflowDocker Apache Airflow
Shell Apache License 2.0 UpdatedAug 18, 2021 -
bigscience-tokenization Public
Forked from bigscience-workshop/tokenizationPython Apache License 2.0 UpdatedJul 22, 2021 -
Annotation_Tools Public
Forked from Machine-Learning-Tokyo/Annotation_ToolsOpen Source Annotation Tools for Computer Vision and NLP tasks
UpdatedApr 5, 2020 -
nlp_data_aug Public
Forked from the-asir/nlp_data_augA fork yet also an upstream for a research collaboration on text augmentation. This repository focuses on perturbations of surface patterns and optimizations with batch-size scaling.
Jupyter Notebook MIT License UpdatedNov 24, 2019 -
playground Public
A junkyard for a computational-linguist-wannabe.
Jupyter Notebook MIT License UpdatedNov 14, 2019 -
-
-
NLP-progress Public
Forked from sebastianruder/NLP-progressRepository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
Python MIT License UpdatedFeb 23, 2019 -
stanfordnlp-srparser-ja Public
Forked from Kitter/stanfordnlp-srparser-jaStanford CoreNLP Shift-Reduce Parser for Japanese (日本語係り受け解析)
Java GNU General Public License v3.0 UpdatedOct 19, 2018 -
NTCIR-13 QA Lab-3 Essay Question-Answering System
-
crfsuite-openmp Public
Forked from chokkan/crfsuiteOpenMP fork of CRFsuite: a fast implementation of Conditional Random Fields (CRFs)
-
pointer-generator Public
Forked from abisee/pointer-generatorCode for the ACL 2017 paper "Get To The Point: Summarization with Pointer-Generator Networks"
Python Other UpdatedMay 29, 2017