И может быть имеет смысл потюнить сборку...

Sumo 17.01.2021 10:03 / 17.01.2021 10:14

Можно включить инкрементальную сборку, обычно это улучшает производительность.

/*
 * Note that this defines a large number of tuning hooks, which can
 * safely be ignored in nearly all cases.  For normal use it suffices
 * to call only GC_MALLOC and perhaps GC_REALLOC.
 * For better performance, also look at GC_MALLOC_ATOMIC, and
 * GC_enable_incremental.  If you need an action to be performed
 * immediately before an object is collected, look at GC_register_finalizer.
 * If you are using Solaris threads, look at the end of this file.
 * Everything else is best ignored unless you encounter performance
 * problems.
 */

/* Enable incremental/generational collection.  Not advisable unless    */
/* dirty bits are available or most heap objects are pointer-free       */
/* (atomic) or immutable.  Don't use in leak finding mode.  Ignored if  */
/* GC_dont_gc is non-zero.  Only the generational piece of this is      */
/* functional if GC_parallel is non-zero or if GC_time_limit is         */
/* GC_TIME_UNLIMITED.  Causes thread-local variant of GC_gcj_malloc()   */
/* to revert to locked allocation.  Must be called before any such      */
/* GC_gcj_malloc() calls.  For best performance, should be called as    */
/* early as possible.  On some platforms, calling it later may have     */
/* adverse effects.                                                     */
/* Safe to call before GC_INIT().  Includes a  GC_init() call.          */
GC_API void GC_CALL GC_enable_incremental(void);

Или даже, включить parallel mark, но это уже кажется некоторым перебором и не совсем понятен статус этой фичи в либе:

#ifdef GC_THREADS
  GC_API GC_ATTR_DEPRECATED int GC_parallel;
                        /* GC is parallelized for performance on        */
                        /* multiprocessors.  Currently set only         */
                        /* implicitly if collector is built with        */
                        /* PARALLEL_MARK defined and if either:         */
                        /*  Env variable GC_NPROC is set to > 1, or     */
                        /*  GC_NPROC is not set and this is an MP.      */
                        /* If GC_parallel is on (non-zero), incremental */
                        /* collection is only partially functional,     */
                        /* and may not be desirable.  The getter does   */
                        /* not use or need synchronization (i.e.        */
                        /* acquiring the GC lock).  Starting from       */
                        /* GC v7.3, GC_parallel value is equal to the   */
                        /* number of marker threads minus one (i.e.     */
                        /* number of existing parallel marker threads   */
                        /* excluding the initiating one).               */
  GC_API int GC_CALL GC_get_parallel(void);

В доке пишут такое:

## Performance

We conducted some simple experiments with a version of
[our GC benchmark](http://www.hboehm.info/gc/gc_bench/) that was slightly
modified to run multiple concurrent client threads in the same address space.
Each client thread does the same work as the original benchmark, but they
share a heap. This benchmark involves very little work outside of memory
allocation. This was run with GC 6.0alpha3 on a dual processor Pentium III/500
machine under Linux 2.2.12.

Running with a thread-unsafe collector, the benchmark ran in 9 seconds. With
the simple thread-safe collector, built with `-DGC_THREADS`, the execution
time increased to 10.3 seconds, or 23.5 elapsed seconds with two clients. (The
times for the `malloc`/`free` version with glibc `malloc` are 10.51 (standard
library, pthreads not linked), 20.90 (one thread, pthreads linked), and 24.55
seconds respectively. The benchmark favors a garbage collector, since most
objects are small.)

The following table gives execution times for the collector built with
parallel marking and thread-local allocation support
(`-DGC_THREADS -DPARALLEL_MARK -DTHREAD_LOCAL_ALLOC`). We tested the client
using either one or two marker threads, and running one or two client threads.
Note that the client uses thread local allocation exclusively. With
`-DTHREAD_LOCAL_ALLOC` the collector switches to a locking strategy that
is better tuned to less frequent lock acquisition. The standard allocation
primitives thus perform slightly worse than without `-DTHREAD_LOCAL_ALLOC`,
and should be avoided in time-critical code.

(The results using `pthread_mutex_lock` directly for allocation locking would
have been worse still, at least for older versions of linuxthreads. With
`-DTHREAD_LOCAL_ALLOC`, we first repeatedly try to acquire the lock with
`pthread_mutex_try_lock`, busy-waiting between attempts. After a fixed number
of attempts, we use `pthread_mutex_lock`.)

These measurements do not use incremental collection, nor was prefetching
enabled in the marker. We used the C version of the benchmark. All
measurements are in elapsed seconds on an unloaded machine.

Number of threads| 1 marker thread (secs.) | 2 marker threads (secs.)
---|---|---
1 client| 10.45| 7.85 | 2 clients| 19.95| 12.3

The execution time for the single threaded case is slightly worse than with
simple locking. However, even the single-threaded benchmark runs faster than
even the thread-unsafe version if a second processor is available. The
execution time for two clients with thread local allocation time is only 1.4
times the sequential execution time for a single thread in a thread-unsafe
environment, even though it involves twice the client work. That represents
close to a factor of 2 improvement over the 2 client case with the old
collector. The old collector clearly still suffered from some contention
overhead, in spite of the fact that the locking scheme had been fairly well
tuned.

Full linear speedup (i.e. the same execution time for 1 client on one
processor as 2 clients on 2 processors) is probably not achievable on this
kind of hardware even with such a small number of processors, since the memory
system is a major constraint for the garbage collector, the processors usually
share a single memory bus, and thus the aggregate memory bandwidth does not
increase in proportion to the number of processors.

These results are likely to be very sensitive to both hardware and OS issues.
Preliminary experiments with an older Pentium Pro machine running an older
kernel were far less encouraging.

Парсер 3.4.6b - встроенный веб-сервер, moko [M] 18.12.2020 16:31 / 18.12.2020 16:47
- Как тут реализовать rewrite аля apache? httpS - через nginx с проксированием запросов в парсер?, sergei v.2 06.08.2022 12:28 / 06.08.2022 12:40
  - Ответ, moko [M] 09.08.2022 08:19
    - сделал через nginx ..., sergei v.2 09.08.2022 22:51
- А что процесс сервера делает в фоне?, G_Z [M] 30.11.2021 04:49
  - Ответ, moko [M] 30.11.2021 19:27
    - Ответ, G_Z [M] 30.11.2021 23:19 / 01.12.2021 14:26
- httpd class is undefined, G_Z [M] 26.11.2021 23:31 / 27.11.2021 05:57
  - Ответ, moko [M] 27.11.2021 17:20
    - Ответ, G_Z [M] 28.11.2021 02:50
- x64 не запускается в win-x64, Maxx [M] 25.01.2021 08:12 / 25.01.2021 08:17
  - Подозреваю, что Windows Server 2003 просто не может запустить бинарник, собранный Visual Studio 2015, moko [M] 25.01.2021 11:55
- Автоматический сборщик мусора, Maxx [M] 15.01.2021 06:05
  - ^memory:auto-compact(N) - сделано, moko [M] 16.01.2021 20:37
    - Супер!, Maxx [M] 18.01.2021 00:06
      - https://www.parser.ru/download/beta/ (-), moko [M] 18.01.2021 04:08
    - И может быть имеет смысл потюнить сборку..., Sumo [M] 17.01.2021 10:03 / 17.01.2021 10:14
      - Сомнительно, moko [M] 17.01.2021 11:48
        Так я вручную и собираю там где требуется..., Sumo [M] 18.01.2021 10:47
    - А нету ли статистики сборки в библиотеке?, Sumo [M] 17.01.2021 09:24 / 17.01.2021 10:21
    - Можно сделать, чтобы без параметров метод возвращал текущее значение? (-), Sumo [M] 17.01.2021 09:10
  - Автоматический режим сборщика мусора, moko [M] 16.01.2021 01:40
    - ^memory:auto-compact(N)?, moko [M] 16.01.2021 03:40
      - +1 (-), Sumo [M] 16.01.2021 08:30
  - Ответ, G_Z [M] 15.01.2021 15:57
- $main:HTTPD.mode must be 'sequental' or 'threaded', G_Z [M] 28.12.2020 18:27
  - Без идей, moko [M] 29.12.2020 01:11 / 29.12.2020 01:12
    - Ответ, G_Z [M] 29.12.2020 01:25
      - Вероятно fixed, moko [M] 29.12.2020 14:22
        Да, заработало (-), G_Z [M] 29.12.2020 16:21
- А как с кэшированием?, G_Z [M] 28.12.2020 05:28 / 28.12.2020 05:29
  - Проблему не подтверждаю, moko [M] 28.12.2020 12:07 / 28.12.2020 12:44
    - Ответ, G_Z [M] 28.12.2020 15:29 / 28.12.2020 15:29
      - Ответ, moko [M] 28.12.2020 15:38
        Ответ, G_Z [M] 28.12.2020 15:44 / 28.12.2020 16:05
- Прикольно :), Maxx [M] 25.12.2020 07:38
  - Ответ, moko 25.12.2020 13:42
- Так что получается через встроенный вебсервер он будет работать быстрее?, coel 25.12.2020 03:19
  - https://www.parser.ru/forum/?id=85625 (-), moko [M] 25.12.2020 13:37
- Ух-ты, значимость этого недооценена!, AlexZimmer 24.12.2020 22:54
  - Соглашусь, такой кейс заиграл новыми красками, Colonel 26.12.2020 03:10
  - так sqlite давно поддерживается, это и есть БД в файлах без сервера, куча ПО им пользуется. (-), [M] 25.12.2020 01:30
    - Да, слышал но ни разу не пользовал, думал, что..., AlexZimmer 25.12.2020 18:25
      - В случае дистрибутива парсера можно сразу использовать, moko [M] 25.12.2020 18:51
- Кстати www.parser.ru уже сутки работает на встроенном в парсер веб-сервере. :), moko [M] 23.12.2020 14:08
  - www.parser.ru на веб-сервере парсера - две недели без перезапуска, moko [M] 16.01.2021 20:42
    - Звучит здорово. Обычно утечки появляются на больших нагрузках..., Sumo [M] 17.01.2021 09:27
      - ab -c 10, moko [M] 17.01.2021 13:59
  - С nginx + FastCGIWrapper не сравнивали? (-), G_Z [M] 23.12.2020 20:10
    - А что там сравнивать?, moko [M] 24.12.2020 12:44 / 24.12.2020 12:45
      - может я криво это настроил..., stur 18.02.2021 22:49
        По такому описанию сложно сказать что именно, но да, что-то наверное не так, moko [M] 19.02.2021 15:58
        если должно работать, то надо будет разбираться..., stur 19.02.2021 22:20
        Ответ, moko [M] 19.02.2021 22:33
        Вопрос пока снят, спасибо за пояснения! :), stur 20.02.2021 20:50
- И в docker! :), redactor [M] 22.12.2020 23:17
  - Так всегда можно было под fcgiwrap ;) (-), Sumo [M] 24.12.2020 07:30
- GET и POST бы в form различать, для порядка ещё, Colonel 21.12.2020 04:53
  - Вы расстроитесь..., Sumo [M] 21.12.2020 09:37 / 21.12.2020 09:40
    - Не вижу противоречий., Colonel 23.12.2020 06:12
      - Ответ, G_Z [M] 23.12.2020 20:02
        А в чем проблема ленивой реализации?, Colonel 24.12.2020 04:49
        Ответ известный — примем патч ;) (-), Sumo [M] 24.12.2020 07:31
      - *Опечатка, Colonel 23.12.2020 06:26
- Круто! Наконец-то мы дожили до даймона парсера., Ivan Sergeev 20.12.2020 13:15
- Может его вынести в отдельный класс?, G_Z [M] 18.12.2020 20:29
  - В принципе можно, moko [M] 19.12.2020 00:41
    - Ответ, G_Z [M] 19.12.2020 01:03
      - Ответ, moko [M] 19.12.2020 21:22
        А как задаётся document-root? (-), G_Z [M] 19.12.2020 21:25
        Ответ, moko [M] 19.12.2020 22:25
        Ясно, G_Z [M] 19.12.2020 22:27

Новости	FAQ	Авторы	Документация	В действии	Библиотека
Инструменты	Полезные ссылки	Хостинги	Скачать	Примеры	Форум