Skip to content

CONCAT ERROR(S) when running CN algorithm #345

@SarahGrevy

Description

@SarahGrevy

I have the right packages installed and the correct Python version, so why does this error come up when running the CN algorithm?

MERGE ERROR(S) AT: /burg/home/sgg2140/communitynotes/sourcecode/scoring/pflip_plus_model.py, in _compute_scoring_cutoff, at line 348: scoringCutoff = scoringCutoff.merge(cutoffByRatings[[c.noteIdKey, "ratingMin"]])
PandasTypeError: Type expectation mismatch on noteId: found=Int64 expected=int64
PandasTypeError: Input mismatch on noteId: left=int64 vs right=Int64 (UNALLOWED)
PandasTypeError: Merge key mismatch on noteId: left=int64 vs right=Int64 (UNALLOWED)
PandasTypeError: Output mismatch on noteId: result=int64 expected=None (UNALLOWED)

INFO:birdwatch.constants:Fitting pflip model elapsed time: 11.41 secs (0.19 mins)
Traceback (most recent call last):
File "/burg/home/sgg2140/communitynotes/sourcecode/main.py", line 33, in
main()
File "/burg/home/sgg2140/communitynotes/sourcecode/scoring/runner.py", line 269, in main
return _run_scorer(args=args, dataLoader=dataLoader, extraScoringArgs=extraScoringArgs)
File "/burg/home/sgg2140/communitynotes/sourcecode/scoring/pandas_utils.py", line 682, in _inner
retVal = main(*args, **kwargs)
File "/burg/home/sgg2140/communitynotes/sourcecode/scoring/runner.py", line 222, in _run_scorer
scoredNotes, helpfulnessScores, newStatus, auxNoteInfo = run_scoring(
File "/burg/home/sgg2140/communitynotes/sourcecode/scoring/run_scoring.py", line 1987, in run_scoring
) = run_prescoring(
File "/burg/home/sgg2140/communitynotes/sourcecode/scoring/run_scoring.py", line 1268, in run_prescoring
pflipPlusModel.fit(notes, ratings, noteStatusHistory, prescoringRaterModelOutput)
File "/burg/home/sgg2140/communitynotes/sourcecode/scoring/pflip_plus_model.py", line 1566, in fit
self._prepare_note_info(
File "/burg/home/sgg2140/communitynotes/sourcecode/scoring/pflip_plus_model.py", line 1054, in _prepare_note_info
scoringCutoff = self._compute_scoring_cutoff(
File "/burg/home/sgg2140/communitynotes/sourcecode/scoring/pflip_plus_model.py", line 349, in _compute_scoring_cutoff
assert len(scoringCutoff) == beforeMerge
AssertionError
srun: error: g276: task 0: Exited with exit code 1
ERROR: main.py failed

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions