Viewing 7 posts - 1 through 7 (of 7 total)
Hi Alan,
thanks for reading them. It would have been nice to see your longer answer.
Microsoft has(had?) a reseearch group on data cleaning https://www.microsoft.com/en-us/research/project/data-cleaning/
which resulted...
February 14, 2019 at 3:19 pm
Here is another one:
Lauri
November 22, 2018 at 1:47 am
Hi,
there has , in fact been some research on fast qgrams with SQL, see for instance
http://www.cs.columbia.edu/~gravano/Papers/2001/deb01.pdf
Lauri
November 22, 2018 at 1:37 am
Well, not really procedural, since all those "side tables" can be created using "vanilla" SQL. Your approach may run into problems if you are comparing thousands of values with each...
August 3, 2017 at 5:56 am
There has actually been some research on this approach, see for instance:
http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.14.6750&rep=rep1&type=pdf
And Microsoft SSIS uses a similar approach in it's "fuzzy matching" component, see:
https://www.microsoft.com/en-us/research/project/data-cleaning/
August 3, 2017 at 5:18 am
Thank you for your kind words!
Indeed, there can be any number of reasons for overlapping indexes and it certainly would be irresponsible to just go and drop them...
June 26, 2014 at 2:18 pm
May I suggest an alternative script for the same purpose? It takes into consideration ascending/descending but gives false positives for "mirror image" indexes, e.g.
(c1 , c2 desc, c3) can be...
June 26, 2014 at 1:49 pm
Viewing 7 posts - 1 through 7 (of 7 total)