Edits to the knowledge docs

mcorbin-ibm · jjasghar · juliadenham · mcorbin-ibm · commit 0330db7f7f3d · 2024-09-03T16:11:02.000-04:00
- edited 3 knowledge docs files
- removed broken links and list of knowledge domains

Signed-off-by: Michelle Corbin &lt;corbinm@us.ibm.com&gt;
Co-Authored-By: JJ Asghar &lt;awesome@ibm.com&gt;
Co-Authored-By: Julia Denham &lt;jdenham@redhat.com&gt;
diff --git a/docs/taxonomy/knowledge/contribution_details.md b/docs/taxonomy/knowledge/contribution_details.md
@@ -4,22 +4,22 @@ description: The overview of 🐶 InstructLab's Knowledge contribution guideline
 logo: images/ilab_dog.png
 ---
 
-You can create a Git repository to host your knowledge contributions anywhere (GitLab, Gerrit, etc.) but it may be favorable to create one on GitHub. The following instructions show you how to create a knowledge repository in GitHub and contribute to the taxonomy.
+You can create a Git repository to host your knowledge contributions anywhere (GitLab, Gerrit, etc.) but it might be favorable to create one on GitHub. The following instructions show you how to create a knowledge repository in GitHub and contribute to the taxonomy.
 
 ## Prerequisites
 
 - You have a GitHub account
 - You have a forked copy of the [taxonomy](https://github.com/instructlab/taxonomy/tree/main) repository
-- Verify that the model does not already know the knowledge you want to submit
+- You have verified that the model does not already know the knowledge you want to submit
 
 ## Creating your own knowledge repository
 
 To create a new GitHub repository, follow the GitHub documentation in [Creating a new repository](https://docs.github.com/en/repositories/creating-and-managing-repositories/creating-a-new-repository).
 
 The specific steps are listed as follows:
 
-1. In your GitHub profile page, navigate to the repositories tab. You will see a search bar where you can search your repositories, or create a new one.
-2. This takes you to a page titled “Create a new repository”. Create a custom name for your repository and add a README.md file. For example, “knowlege_contributions” could be a good name for your repository.
+1. In your GitHub profile page, navigate to the repositories tab. You will see a search bar where you can search your repositories or create a new one.
+2. This takes you to a page titled “Create a new repository”. Create a custom name for your repository and add a `README.md` file. For example, “knowlege_contributions” could be a good name for your repository.
 3. Click “Create” when you are all set.
 
 ## Convert your knowledge documentation to markdown
@@ -40,15 +40,15 @@ The specific steps are listed as follows:
 3. You can then see your new content in your repository.
 
 !!! important
-    Make a note of your commit SHA; you need it for your `qna.yaml`.
+    Make a note of your commit SHA; you'll need it for your `qna.yaml`.
 
 ## Create a pull request in the taxonomy repository
 
 Navigate to your forked taxonomy repository and ensure it is up-to-date.
 
 There are a few ways you can create a pull request:
 
-- For details on the local process, check out [The GitHub Workflow Guide](https://github.com/kubernetes/community/blob/master/contributors/guide/github-workflow.md) in the kubernetes documentation and the [GitHub flow](https://docs.github.com/en/get-started/using-github/github-flow) in the GitHub documentation.
+- For details on the local process, check out [The GitHub Workflow Guide](https://github.com/kubernetes/community/blob/master/contributors/guide/github-workflow.md) in the Kubernetes documentation and the [GitHub flow](https://docs.github.com/en/get-started/using-github/github-flow) in the GitHub documentation.
 - For details on contributing using the GitHub webpage UI, see [Contributing using the GH UI](https://github.com/instructlab/taxonomy/docs/contributing_via_GH_UI.md) or [Creating a pull request](https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/proposing-changes-to-your-work-with-pull-requests/creating-a-pull-request?tool=webui) in the GitHub documentation.
 
 ## Verification
@@ -61,7 +61,7 @@ Here are a few things to check before seeking reviews for your contribution:
 
 ## PR Upstream Workflow
 
-The following table outlines the expected timing for the PR(s) you have put in. The PRs go through a few steps, and checks, but you should be able to map your `label` to
+The following table outlines the expected timing for the PRs you have submitted. The PRs go through a few steps, and checks, but you should be able to map your `label` to
 the place that it is in.
 
 | Label | Actor | Action | Duration |
diff --git a/docs/taxonomy/knowledge/guide.md b/docs/taxonomy/knowledge/guide.md
@@ -1,6 +1,6 @@
 ---
 title: Knowledge Guide
-description: The overview of 🐶 InstructLab's knowledge
+description: An overview of 🐶 InstructLab's knowledge
 logo: images/ilab_dog.png
 ---
 # What is "Knowledge"?
@@ -11,7 +11,7 @@ Knowledge contributions in this project contain a few things.
 
 - A file in a git repository that holds your information. For example, these repositories can include markdown versions of information on: Oscar 2024 winners, Law books, Shakespeare, Sports, Chemistry, etc.
 - A `qna.yaml` file that asks and answers questions about the information in the git repository.
-- A `attribution.txt` that includes the sources for the information used in the `qna.yaml`.
+- An `attribution.txt` file that includes the sources for the information used in the `qna.yaml`.
 
 You can learn more about the knowledge structure in [Getting Started with Knowledge contributions](https://github.com/instructlab/taxonomy/blob/main/README.md#getting-started-with-knowledge-contributions).
 
@@ -20,97 +20,39 @@ You can learn more about the knowledge structure in [Getting Started with Knowle
 !!! important
     We are currently only accepting knowledge contributions as a limited private beta and sources will be limited to articles from Wikipedia.
 
-There are a few domains of knowledge that we are currently accepting. For a full list of knowledge fields, see [Knowledge domains](https://github.com/instructlab/taxonomy/blob/main/knowledge/knowledge_domains.md) in the taxonomy documentation
-
-A few examples are as follows:
-
-### STEM fields
-
-- Physics
-  - Astronomy and Astrophysics
-  - Quantum Mechanics
-  - Special Relativity and General Relativity
-
-- Chemistry & Chemical Engineering
-  - Organic Chemistry
-  - Inorganic Chemistry
-  - Chemical engineering
-  - Biotechnology
-
-- Earth & Environmental Science
-  - Geology
-  - Geography
-
-- Biology & Life Sciences
-  - Plants (Botany)
-  - Medicine & health
-
-- Electrical Engineering
-- Bioengineering
-- Civil Engineering
-- Industrial Engineering
-
-### Legal and regulatory
-
-- Intellectual Property
-- Criminal Law
-- Civil Rights
-- Healthcare compliance
-
-### Economy and Business
-
-- Economy and Businesses
-- Accounting and Finance
-- Marketing
-- Human Resource
-- Management
-
-### Philosophy
-
-- Philosophy
-- Metaphysics
-- Epistemology
-- Ethics
-- Parapsychology & occultism
-- Philosophical schools of thought
-
-### Literature
-
-- Literature, rhetoric & criticism
-- American literature in English
-- Other literatures
+These are the main knowledge domains that we are currently accepting knowledge contributions for:  arts, engineering, geography, history, linguistics, mathematics, philosophy, religion, science, and technology.
 
 ## Avoid These Topics
 
-While the tuning process may eventually benefit from being used to help the models work with complex social topics, at this time this is an area of active research we do not want to take lightly. Therefore please keep your submissions clear of the following topics:
+While the tuning process may eventually benefit from being used to help the models work with complex social topics, at this time this is an area of active research we do not want to take lightly. Therefore, please keep your submissions clear of the following topics:
 
 - PII (personally identifiable information) or any content invasive of individual privacy rights
-- Violence including self-harm
-- Cyber Bullying
-- Internal documentation or other that is confidential to your employer or organization, e.g. trade secrets
+- Violence, including self-harm
+- Cyber bullying
+- Internal documentation or other information that is confidential to your employer or organization, such as trade secrets
 - Discrimination
 - Religion
-  - Facts such as, "[Christianity is, according to the 2011 census, the fifth most practiced religion in Nepal, with 375,699 adherents, or 1.4% of the population](https://en.wikipedia.org/wiki/Christianity_in_Nepal)", are fine as a knowledge contribution. Advocating in favor of or against any religious faith is not acceptable.
+  - Facts such as, "[Christianity is, according to the 2011 census, the fifth most practiced religion in Nepal, with 375,699 adherents, or 1.4% of the population](https://en.wikipedia.org/wiki/Christianity_in_Nepal)", are fine as a knowledge contribution. However, advocating in favor of or against any religious faith is not acceptable.
 - Medical or health information
-  - Facts such as,  "[In mammals, pulmonary ventilation occurs via inhalation (breathing)](https://opentextbc.ca/biology/chapter/11-3-circulatory-and-respiratory-systems/)," are fine as a knowledge contribution. Tailored medical/health advice is not acceptable.
+  - Facts such as,  "[In mammals, pulmonary ventilation occurs via inhalation (breathing)](https://opentextbc.ca/biology/chapter/11-3-circulatory-and-respiratory-systems/)," are fine as a knowledge contribution. However, tailored medical/health advice is not acceptable.
 - Financial information
-  - Facts such as "[laissez-faire economics ... argues that market forces alone should drive the economy and that governments should refrain from direct intervention in or moderation of the economic system](https://openstax.org/books/world-history-volume-2/pages/6-3-capitalism-and-the-first-industrial-revolution)," are fine as a knowledge contribution. Tailored financial advice is not acceptable.
-- Legal settlements/mitigations
-- Gender Bias
-- Hostile Language, threats, slurs, derogatory or insensitive jokes or comments
+  - Facts such as "[laissez-faire economics ... argues that market forces alone should drive the economy and that governments should refrain from direct intervention in or moderation of the economic system](https://openstax.org/books/world-history-volume-2/pages/6-3-capitalism-and-the-first-industrial-revolution)," are fine as a knowledge contribution. However, tailored financial advice is not acceptable.
+- Legal settlements or mitigations
+- Gender bias
+- Hostile language, threats, slurs, and derogatory or insensitive jokes or comments
 - Profanity
 - Pornography and sexually explicit or suggestive content
-- Any contributions that would allow for automated decision making that affect an individual's rights or well-being, e.g. social scoring
+- Any contributions that would allow for automated decision making that affect an individual's rights or well-being, such as social scoring
 - Any contributions that engage in political campaigning or lobbying
 
 We are also not accepting submissions of the following content:
 
 - Code
-  - Anything code-related that can be traced back to code for a computer. Not limited to `sed` or `bash` but `yaml`s for OpenShift or Kubernetes, to `python` snippets to `Java` suggestions. There are specific models focused on this space and this isn't for this model for the time being.
+  - Anything code-related that can be traced back to code for a computer. Not limited to `sed` or `bash` or `yaml`s for OpenShift or Kubernetes, to `python` snippets to `Java` suggestions. There are specific models focused on this space and this isn't for this model for the time being.
 - Jokes
 - Poems
 
-We received many joke and poem submissions at the beginning of the project, and with jokes being "in the eye of the beholder" and puns requiring nuance for native English speakers, we realized we were possibly unconsciously biasing our model. We have discovered that working with both topics has its own challenges, and if we want something generalized, finding consensus was unsuccessful. For now, we're not accepting additional submissions of jokes and poems.
+We received many joke and poem submissions at the beginning of the project, and with jokes and poems being "in the eye of the beholder" and puns requiring nuance for native English speakers, we realized we were possibly unconsciously biasing our model. We have discovered that working with both topics has its own challenges, and if we want something generalized, finding consensus was unsuccessful. For now, we're not accepting additional submissions of jokes and poems.
 
 ## Building Your LLM Intuition
 
@@ -130,28 +72,34 @@ With a few of these qna's, the model will learn the periodic table because it ha
 
 ### LLMs are great at
 
-For these, however, it's common for LLMs to already have excellent performance. Try 3-5 examples in `lab chat` to confirm a deficit in the model before you build your submission, and share the examples in your Pull Request (PR).
+LLMs are great at these:
 
 - Brainstorming
 - Creativity
 - Connecting information
 - Cross-lingual behavior
 
+For these, however, it's common for LLMs to already have excellent performance. Try 3-5 examples in `lab chat` to confirm a deficit in the model before you build your submission, and then share the examples in your Pull Request (PR).
+
 ### LLMs need help with
 
-LLM behavior in these sorts of topics are very difficult for the model to get right. Try several examples to understand the nuances of the model's ability to do these sorts of tasks, and consider using corrections to the results you get in your tuning process.
+LLMs need help with these:
 
 - Chains of reasoning
 - Analysis
 - Story plots
 - Reassembling information
 - Effective and succinct summaries
 
+LLM behavior in these sorts of topics are very difficult for the model to get right. Try several examples to understand the nuances of the model's ability to do these sorts of tasks, and then consider using corrections to the results you get in your tuning process.
+
 ### LLMs are not so great at
 
-LLMs may struggle with solving math and computation. That said, improving some of these foundational skills may be something this work tackles in the future, but not at this time.
+LLMs are not so great at these:
 
 - Math
 - Computation
 - "Turing-complete" type tasks
 - Generating only true real-world information (they're prone to hallucinations)
+
+ LLMs may struggle with solving math and computation problems. That said, improving some of these foundational skills may be something this work tackles in the future, but not at this time.
diff --git a/docs/taxonomy/knowledge/index.md b/docs/taxonomy/knowledge/index.md