Commit 4609f97
authored
change loss return format so that it can work with calculate_per_token_loss (NVIDIA-NeMo#12459)
* loss upscaling has been moved to MCore, no need to handle it in model level any more
Signed-off-by: Xiaowei Ren <xren@nvidia.com>
* return loss_sum and num_valid_tokens separately
Signed-off-by: Xiaowei Ren <xren@nvidia.com>
* change num_tokens dtype to int
Signed-off-by: Xiaowei Ren <xren@nvidia.com>
* fix a return type
Signed-off-by: Xiaowei Ren <xren@nvidia.com>
* clean masked_token_loss
Signed-off-by: Xiaowei Ren <xren@nvidia.com>
* Apply isort and black reformatting
Signed-off-by: xrennvidia <xrennvidia@users.noreply.github.com>
* minor fix
Signed-off-by: Xiaowei Ren <xren@nvidia.com>
* Apply isort and black reformatting
Signed-off-by: xrennvidia <xrennvidia@users.noreply.github.com>
* minor fix
Signed-off-by: Xiaowei Ren <xren@nvidia.com>
* Apply isort and black reformatting
Signed-off-by: xrennvidia <xrennvidia@users.noreply.github.com>
* bug fix
Signed-off-by: Xiaowei Ren <xren@nvidia.com>
* remove one unused import
Signed-off-by: Xiaowei Ren <xren@nvidia.com>
* fix pylint error
Signed-off-by: Xiaowei Ren <xren@nvidia.com>
* Apply isort and black reformatting
Signed-off-by: xrennvidia <xrennvidia@users.noreply.github.com>
---------
Signed-off-by: Xiaowei Ren <xren@nvidia.com>
Signed-off-by: xrennvidia <xrennvidia@users.noreply.github.com>
Co-authored-by: xrennvidia <xrennvidia@users.noreply.github.com>1 parent 2f08584 commit 4609f97
2 files changed
Lines changed: 29 additions & 44 deletions
Lines changed: 9 additions & 4 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
46 | 46 | | |
47 | 47 | | |
48 | 48 | | |
49 | | - | |
50 | 49 | | |
51 | 50 | | |
52 | 51 | | |
| |||
883 | 882 | | |
884 | 883 | | |
885 | 884 | | |
886 | | - | |
| 885 | + | |
887 | 886 | | |
888 | 887 | | |
889 | 888 | | |
| |||
915 | 914 | | |
916 | 915 | | |
917 | 916 | | |
918 | | - | |
919 | | - | |
| 917 | + | |
| 918 | + | |
| 919 | + | |
| 920 | + | |
| 921 | + | |
| 922 | + | |
| 923 | + | |
| 924 | + | |
920 | 925 | | |
921 | 926 | | |
922 | 927 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1742 | 1742 | | |
1743 | 1743 | | |
1744 | 1744 | | |
1745 | | - | |
| 1745 | + | |
1746 | 1746 | | |
1747 | 1747 | | |
1748 | 1748 | | |
| |||
1752 | 1752 | | |
1753 | 1753 | | |
1754 | 1754 | | |
| 1755 | + | |
1755 | 1756 | | |
1756 | | - | |
1757 | | - | |
| 1757 | + | |
| 1758 | + | |
| 1759 | + | |
1758 | 1760 | | |
1759 | | - | |
1760 | | - | |
1761 | | - | |
| 1761 | + | |
| 1762 | + | |
| 1763 | + | |
| 1764 | + | |
1762 | 1765 | | |
1763 | 1766 | | |
1764 | | - | |
1765 | | - | |
1766 | | - | |
| 1767 | + | |
| 1768 | + | |
1767 | 1769 | | |
1768 | | - | |
1769 | | - | |
1770 | 1770 | | |
1771 | 1771 | | |
1772 | | - | |
1773 | | - | |
1774 | | - | |
1775 | | - | |
| 1772 | + | |
1776 | 1773 | | |
1777 | 1774 | | |
1778 | | - | |
| 1775 | + | |
1779 | 1776 | | |
1780 | | - | |
1781 | | - | |
| 1777 | + | |
| 1778 | + | |
1782 | 1779 | | |
1783 | 1780 | | |
1784 | 1781 | | |
| |||
1818 | 1815 | | |
1819 | 1816 | | |
1820 | 1817 | | |
1821 | | - | |
| 1818 | + | |
1822 | 1819 | | |
1823 | 1820 | | |
1824 | 1821 | | |
1825 | 1822 | | |
1826 | 1823 | | |
1827 | | - | |
1828 | | - | |
1829 | | - | |
1830 | | - | |
1831 | | - | |
1832 | | - | |
1833 | | - | |
1834 | | - | |
1835 | | - | |
1836 | | - | |
1837 | | - | |
1838 | | - | |
1839 | | - | |
| 1824 | + | |
| 1825 | + | |
| 1826 | + | |
1840 | 1827 | | |
1841 | | - | |
1842 | | - | |
1843 | | - | |
1844 | | - | |
1845 | | - | |
1846 | | - | |
1847 | | - | |
1848 | | - | |
| 1828 | + | |
1849 | 1829 | | |
1850 | 1830 | | |
1851 | 1831 | | |
| |||
0 commit comments