Velocity_Preserving_Simplification

This is a velocity preserving simplification by NCTU ADSL lab. It includes Adaptive Trajectory Simplification (ATS) and Non-Partition Adaptive Trajectory Simplification (NP-ATS), wihch is s streaming version of ATS

Required Environment

PyPy 5.1.1 as python runtime environment.

PostgreSQL 9.5.2

to link pgSQL correctly, you need modify dbname, user and passwd in get_trajectory.py, runThresholdDecisionModel.py and runExperiments.py to correct value

Dataset

we use a table trajectory.taxi in pgSQL

Column	Type	Comment
tid	bigint	unique trajectory id
index	bigint	index for each point in trajectory
lon	double precision	longitude value
lat	double precision	latitude value
timestamp	timestamp without time zone	recording time

and 2 indexes:

"taxi_tid_index_idx" btree (tid, index)
"taxi_lon_lat_idx" btree (lon, lat)

Python Model

Model Name	Comment
logging	output log information
psycopg2cffi	connect pgSQL
numpy	matrix calculation

Main Function

Our framework is as follows, which includes Threshold Decision Model and Adaptive Trajectory Simplification.

Threshold Decision Model

location: runThresholdDecisionModel.py

Given a trajectory tid file and parameters, output a list of error tolerances and velocity ranges.
1. partition trajectory by gini index to different sub-trajectory with consistent velocity
2. use adaptive score to find the suitable epsilon of each sub-trajectory
3. calculate the average value of epsilons in each velocity range

args:

name	type	comment
-f, --file	string	input file name in tid_list/
-g, --gini	float	partition threshold (0.1, 0.9)
-a, --alpha	float	weight of error and compression (0.1,0.9)
-h, --help		help information

Example:

$ pypy runThresholdDecisionModel.py -f example.txt -g 0.5 -a 0.5
Namespace(alpha=0.5, file='example.txt', gini=0.5)
INFO:root:retrieve dataset
use saved file: dataset/raw/example.txt
INFO:root:partition
INFO:root:threshold calculation
INFO:root:distribution analysis
groupID epsilon
0 17.5818491354 m
1 32.1597972196 m
2 66.5642479441 m
3 101.584648286 m
[0.0001583950372554716, 0.0002897279028794159, 0.0005996779094060433, 0.0009151770115898896]  // epsilon list

Adaptive Trajectory Simplification

location: simplication/ATS.py

function: ATS(trajectory, min_gini)

Given a raw traejctory and a threshold of gini index, retuen a simplified trajectory.
1. partition trajectory by gini index to different sub-trajectory with consistent velocity
2. simplify each sub-trajectory adaptively by suitable position-error tolerance of threshold mapping table
3. merge all simplified sib-trajectories to a complete simplified trajectory
    
INPUT
    trajectory: raw trajectory
        e.g.
        [
            {'tid': 0, 'index': 0, 'x': 10, 'y': 10.5},
            {'tid': 0, 'index': 1, 'x': 11, 'y': 10.3},
            {'tid': 0, 'index': 2, 'x': 15, 'y': 12.5}...
        ]
    min_gini: partition threshold
OUTPUT
    simplified trajectory idx list
    	e.g. [0, 1, 5, 9, 10]

Non-Partition Adaptive Trajectory Simplification

location: simplication/ATS.py

function: NP_ATS(trajectory, min_gini)

A streaming version of ATS
Given a raw traejctory and a threshold of gini index, retuen a simplified trajectory.
1. add each incomming point p to buffer
2. calculate the error of buffer and whether velocity changes
3. keep p if error > error tolerance from threshold mapping table or velocity changes
4. stop while adding the last point of trajectory to buffer
    
INPUT
    trajectory: raw trajectory
        e.g.
        [
            {'tid': 0, 'index': 0, 'x': 10, 'y': 10.5},
            {'tid': 0, 'index': 1, 'x': 11, 'y': 10.3},
            {'tid': 0, 'index': 2, 'x': 15, 'y': 12.5}...
        ]
    min_gini: partition threshold (useless)
OUTPUT
    simplified trajectory idx list
    	e.g. [0, 1, 5, 9, 10]

Experiments

test the comression rate, velocity error, and effectiveness for DTW, EDR, LCSS

read raw dataset from Database
simplification by different method
process top-k similar trajectories retrival on raw dataset
process top-k similar trajectories retrival on simplified dataset
calculate the compression rate, velocity error, and effectiveness

location: runExperiments.py

args:

name	type	comment
-f, --file	string	input file name in tid_list/
-g, --gini	float	partition threshold (0.1 - 0.9)
-e, --epsilon	float	matching threshold for edr and lcss
-k, --topk	int	number of topk similar trajectory retrieved
-l, --loop	int	number of loops to get teh average values
--task	string	teak name (dtw, edr, lcss)
-h, --help		help informaion

the simplified dataset will save in dataset/ directory
the output file will save im result/ directory

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
simplification		simplification
tid_list		tid_list
.gitignore		.gitignore
README.md		README.md
effectiveness.py		effectiveness.py
exp_framework.png		exp_framework.png
framework.png		framework.png
get_trajectory.py		get_trajectory.py
plot.py		plot.py
runExperiments.py		runExperiments.py
runThresholdDecisionModel.py		runThresholdDecisionModel.py
similarity.py		similarity.py
testSimplificatino.py		testSimplificatino.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Velocity_Preserving_Simplification

Required Environment

Dataset

Python Model

Main Function

Threshold Decision Model

Adaptive Trajectory Simplification

Non-Partition Adaptive Trajectory Simplification

Experiments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Velocity_Preserving_Simplification

Required Environment

Dataset

Python Model

Main Function

Threshold Decision Model

Adaptive Trajectory Simplification

Non-Partition Adaptive Trajectory Simplification

Experiments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages