Menggy Technology


  • Have you had problems in extracting, cleansing, organizing and formatting a bulk of data?
  • Have you got difficulties in computing variables on huge dataset?
  • Have you been troubled in extracting specific content from a large number of PDFs or dealing with Excel sheets by using many messy formulas and filters?
  • For academic researchers, have you ever been struggled in self-implementing highly-complicated functions for your psychology or marketing survey by Qualtrics, and finally find out it doesn't work?
Focus on your research, leave these work to us. We are PROFESSIONAL.

Web Crawling and Scraping


Completed : 23,859,690 Web pages, 75,402,055 records , 17.33 GB
www.imdb.com
www.crowdspring.com
www.itjuzi.com
www.tripadvisor.com
bbs.sgcn.com
www.xueqiu.com
bbs.hupu.com
bbs.taobao.com
www.cuaa.net
www.digikey.com
www.douban.com
www.ebay.com
www.playdota.com
www.propertyguru.com.sg
www.topuniversities.com
beijing.anjuke.com
dzh.mop.com
www.51baomu.cn
www.bloomberg.com
www.ccug.net
www.data.gov.sg
www.github.com
www.google.com
www.hclips.com
www.iteye.com
www.iyp.com.tw
www.laoyaoba.com
www.nuomi.com
www.sse.com.cn
www.szse.cn
www.xinshipu.com
Results can be delivered in VARIOUS encodings and formats, even Customized.

UTF-8

GB2312

TXT

CSV

EXCEL

SQL

JSON

Data Processing


We can process MOST data sources.

PDF
EXCEL
HTML
JSON
TXT
CSV
SQL

We can help you process enormous files or a huge dataset.

  • Extraction: Efficiently and effectively extract useful content from text files or datasets.
  • Clease: Clean datasets and filter out dusty data points.
  • Organization: Remove data redundancies and lower data dependencies based on ER paradigm.
  • Format: Standardize data to certain format, as such that can be used effectively.

Data Computing


  • Basic Stats: Average, Sum, Mean, Median, Standard Deviation, Variance.
  • Network: In/out Degree, Size, Closeness Centrality, Reach Centrality, Shortest Path.
  • Datetime: Time Duration, Days of the Week, Number of Days.
  • Customization: Computing data according to your request.

Research & Web Development


  • Text Analysis: Word Counting, EN/CN Word Segmentation, Stopword removal, Strip Punctuations.
  • Advanced: Text to TF/IDF Vector, Sentiment Analysis, POS Tagger, Data Classification/Clustering.
  • Regression Analysis: Linear, Logistic, Polynomial, Stepwise, Ridge, Lasso, ElasticNet.
  • Research-level Web Development.: Websites in Marketing survey or Psychology studies which requires very complicated functions, such as Condition Randomization, Time Tracking (in milliseconds) or need to collaborate with other devices, such as Skin conductance, BPM, EEG.

Our Clients


Nanyang Business School NUS Business School