Skip to content

Questions on performance #7

@sagpid

Description

@sagpid

Hi,

We download the code and were able to make it work both on a localhost deployment of cassandra and a remote deployment. Thanks a lot of the great piece of work that you have shared, and it has saved us a lot of time and effort.

Please find my questions below on performance.

  1. About 275 map jobs are started in hadoop when a simple select count(*) is issued on the hive. This slows down the query enormously if the query is issued on hive on a external table which is located on cassandra. ( about 30 minutes for 150 records)
  2. If I create hive table from external cassandra table it is very slow. ( About 30 minutes.

Is there a work around or something to be expected from hive side.

thanks

Sagar

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions