Article 8079 of comp.org.decus:

                         The DECUServe Journal
                         ---------------------
                               June, 1997

          From the Editors' Keyboard . . . . . . . . . . . . . 2
            What's inside
          Controlling CPU Consumption  . . . . . . . . . . . . 3
            How to handle "CPU hog" applications
          Problem Copying Backup Saveset . . . . . . . . . . . 8
            Illegal record on tape-to-disk COPY
          Web Page Expiration  . . . . . . . . . . . . . . . . 9
            Specifying an expiration time for a link
          NT 4.0 SP3 and PATHWORKS . . . . . . . . . . . . .  12
            NT fix breaks PATHWORKS authentication
          The Deadly DELNI . . . . . . . . . . . . . . . . .  14
            Pros and (mostly) cons of DELNIs and DEMPRs
          Hooking up to ISDN PRI . . . . . . . . . . . . . .  19
            What do you need between a router and the PRI?
          Laser Printer Error Code . . . . . . . . . . . . .  23
            Meaning of a DEClaser 2150 status code
          Alpha SMP Performance  . . . . . . . . . . . . . .  26
            Observations on multiprocessor systems' performance
          RAID System Disk?  . . . . . . . . . . . . . . . .  29
            Is it safe to use a RAID set as system disk?
          OSF/1 Disk Woes  . . . . . . . . . . . . . . . . .  30
            Get "camlogger" errors after swapping disk
          About the DECUServe Journal  . . . . . . . . . . .  33
          Contact Information  . . . . . . . . . . . . . . .  35


The DECUServe Journal  June, 1997                               Page 2
From the Editors' Keyboard


  From the Editors' Keyboard
  ---- --- -------- --------

  It's June (well, it still is on this coast), and time for another
  serving of DECUServe's best.  On the menu this time are such
  traditional delicacies as OpenVMS tuning arcana and a disagreement
  (civil, of course) over the virtues of certain old pieces of DEC
  networking hardware.  In a slightly more contemporary vein, we also
  have a bit of Web hacking, and an item from the MS_WINDOWS_OS
  conference, which emerged from the old WINDOWS_NT conference with
  its scope broadened to include other Windows platforms.  The "meal"
  is rounded off with a generous helping of hardware hints and a side
  of Unix; just the thing for a summer picnic!  Now, how to run a
  network hookup all the way out to the beach...?

                               * * * * *


The DECUServe Journal  June, 1997                               Page 3
Controlling CPU Consumption


  Controlling CPU Consumption
  ----------- --- -----------

  Abstract:

  A discussion of how tuning parameters will (or won't) help control
  the impact of CPU-intensive applications on a system.


  Participants:

  Gus Altobello, John Briggs, David Campen, Dan Esbensen, Linwood
  Ferguson, Jack Harvey, Alan Hunt, Mark Lasoff, Bart Lederman, Brian
  Tillman.

  Conference:  VMS


Note 2825.0, 24-Apr-1997
Harvey: Controlling CPU Utilization
-----------------------------------
    We have a growing CPU intensive application that is raising some design
    issues.  The application currently lives on a dozen or so "compute
    engines" handling about 120 or so users total.  Roughly ten per engine.
    Disk I/O is negligible.  The current engines are VAXen of about 32 VUPs
    each. The users are using X-windows and the engines are generating the
    displays.

    This is growing, and we aren't sure how best to grow it.  The
    application is designed to run on any number of compute engines.  The
    problem is that one of the ten users per engine can issue commands that
    will suddenly use much of the engine CPU power, blocking other users.

    One approach might be to reduce the total number of engines by
    replacing them with a few (2-3) Alphas of about 400 VUPs each.  The
    idea is that a CPU devourer request from one user won't affect other
    users on the same engine as much, or at least for as long.

    Another approach is to use 120 much smaller VAXen (or maybe PCs),
    assigning only one user per engine.  This would prevent one user from
    stealing CPU power from other users.  The user who issues the CPU
    devouring request will get lousy response, but other users on other
    engines will still be getting good response.

    Both approaches are expensive.  Is there any way we can use VMS to
    bound the amount of CPU that a single user can gobble?  To satisfy
    those who favor one small engine per user, can we limit CPU use by a
    single process to, say, ten percent of the total available?

    I'm interested not only in ideas on how to do this, but in thoughts on
    the choices between the two different approaches described above.


The DECUServe Journal  June, 1997                               Page 4
Controlling CPU Consumption


Note 2825.1, 24-Apr-1997
Campen: 
--------
>    application is designed to run on any number of compute engines.  The
>    problem is that one of the ten users per engine can issue commands that
>    will suddenly use much of the engine CPU power, blocking other users.
    
    Blocking for how long. If all the processes are running at the same
    base priority and this base priority is <= 16 then I don't understand
    why normal VMS scheduling is not giving all users equal access to the
    CPU; if you have ten users each running a compute intensive process
    then each should get 10% of the CPU.
    

Note 2825.2, 24-Apr-1997
Altobello: Some thoughts from a fading brain...
-----------------------------------------------
    I was wondering the same as David.
    
    What I presume is happening is you have several folks doing little
    graphs or something, and one guy kicks off his big compute load.  The
    system gets sluggish then (I've seen this) and the others get
    much-reduced responsiveness.
    
    I'm losing my touch with VMS, but wasn't there a way to tune the amount
    of CPU boost one received for I/O?  If you allow I/O to give a greater
    boost than normal, perhaps you can make up for the CPU hog.
    
    I've also, in the past, played with the quantum and such, making it so
    that the CPU is released more often.
    
    What you appear to be asking is "how do I throttle CPU-bound
    processes", and one way to do that is limit their CPU access and the
    other is to make I/O-bound processes get higher priorities faster.
    
    Finally, if your application knows it's going into a CPU-intensive
    mode, you could possibly have it drop its priority at that time.  If
    you control the apps on the box, this may make things seem "fairer"
    without a lot of effort.
    
    Just some (fading/obsolete) thoughts,
        -gus


Note 2825.3, 24-Apr-1997
Campen: 
--------
    If the case is as Gus describes and quantum is set at its default value
    of 20 (=200 ms) then reducing quantum to perhaps 5 should make the
    system seem less sticky. Reducing quantum will mean a little more CPU
    time is lost to overhead but the default of 20 was chosen a long time
    ago when most machines were 1 VUP.


The DECUServe Journal  June, 1997                               Page 5
Controlling CPU Consumption


>    I'm losing my touch with VMS, but wasn't there a way to tune the amount
>    of CPU boost one received for I/O?  If you allow I/O to give a greater
>    boost than normal, perhaps you can make up for the CPU hog.
    
    I don't known of any way to vary the amount of priority boost a process
    receives for I/O completion. Non-terminal I/O completion gives the
    process a priority boost of 2, terminal output completion a boost of 4
    and terminal input completion a boost of 6. A non-zero value of
    priority_offset would negatively affect the benefits of a priority
    boost so Jack might want to check that this is set to its default value
    of zero.
    

Note 2825.4, 25-Apr-1997
Harvey: Maybe I don't understand what to expect
-----------------------------------------------
    Having the application lower its priority when it starts a big CPU
    gobble might be possible in the long term.  It would require a contract
    negotiation, however. :-(
    
    The SYSGEN value of PRIORITY_OFFSET is zero, so that's not a problem.
    
    However, David raises a point.  Why, under normal VMS scheduling, does
    the hog process block other users?  Yet, as Gus has seen, the blocking
    effect is real.


Note 2825.5, 25-Apr-1997
Lederman: I'm going to suggest the manuals.
-------------------------------------------
    There probably are ways to adjust the system, but I would not go in and
    adjust QUANTUM without a good understanding of what will happen.
    
    May I suggest that you look at the "OpenVMS Performance Management"
    manual, particularly Chapters 9 and 13, and the decision trees in
    Appendix A?  I think you will find useful information there.  There is
    also a feature which appears to be little used in VMS called Class
    Scheduling that allows the system to react differently to some classes
    of programs than to others.  There should be a sample program in
    SYS$EXAMPLES:CLASS.C that says something about it.
    

Note 2825.6, 25-Apr-1997
Briggs: Dynamic priority reduction
----------------------------------
I have two suggestions.

1.  Enable the class scheduler.  Put each process in its own class
    and give the class a maximum of 50% CPU utilization.


The DECUServe Journal  June, 1997                               Page 6
Controlling CPU Consumption


    Unfortunately, from what I remember of the class scheduler implementation,
    this will be bursty -- your gobbler will get lots of CPU for the
    first half of each sample interval and then get chopped off for the
    last half.

2.  There is a piece of freeware that we found at one point.  It monitors
    CPU utilization and imposes a dynamic priority reduction on CPU-intensive
    processes.  It's been a while and I have no idea where to find it.
    But it's not a tough wheel to re-invent.

    This actually worked pretty well and kept the system responsive without
    killing gobbler processes utterly.


Note 2825.7, 25-Apr-1997
Tillman: ForWords shows...
--------------------------
    3RD_PARTY_VMS_SOFTWARE topic 10 contains some discussion on this.  In
    particular, 10.24 speaks about lowering quantum and the effect it can
    have on overhead.


Note 2825.8, 25-Apr-1997
Lasoff: Use DCL
---------------
    Hi Jack.  A quick solution would be to write a DCL procedure (or
    program) that wakes up every so often, checks all processes on the
    system, and if it notices ANY user running program XXX, then it does a
    $SET PROCESS/PRIORITY=0/ID=pid.
    
    You may need to consider raising that pid's priority at a later time if
    the user stays logged-in after running the CPU-intensive application.
    
    This is not elegant, but it will prevent this process from taking over
    the system even with the CPU priority boosts of VMS.


Note 2825.9, 25-Apr-1997
Ferguson: Quantum -- try it
---------------------------
>    However, David raises a point.  Why, under normal VMS scheduling, does
>    the hog process block other users?  Yet, as Gus has seen, the blocking
>    effect is real.
    
    It's real, but I'm not sure why you see it as unexpected.  A lot
    depends on what the other users are doing.  If they are doing lots of
    things that give up the CPU (e.g. keystrokes) you tend to get something
    like 200ms for the hog, 1ms for A (because it lets go), 1ms for B
    (same), 200ms for the hog.  Each time through you wait for the hog
    before you give the process its 1ms, but then it lets go and doesn't
    consume its share of quantum.  The hog always does.


The DECUServe Journal  June, 1997                               Page 7
Controlling CPU Consumption


    200ms is a long, long time to hold a CPU if a lot of people are
    waiting.  Get a couple processes per CPU that will hold it the whole
    200ms and interactive response gets dirt slow.  Someone mentioned older
    vaxen, but also bear in mind batch work use to be more important that
    crisp interactive response (or at least more than now -- people are
    spoiled and expect instant gratification -- I use to remember when a
    785 seemed lightning fast).
    
    Quantum is relatively harmless, drop it way on down and see what you
    think.  Stick it up high to get a real feel for the range of effects.
    You can change it dynamically from moment to moment.
    
    If you do start changing priorities of the COM processes, One other
    thing you might look at if you find long compute queues is to adjust
    PIXSCAN.  It's default of 10 looks through only a few processes for
    ones COM[o] to boost.  Having this too low relative to the number of
    total processes can let one high priority COM process never get
    pre-empted.  It's kind of like the I/O boost of priority on completion,
    to give others at least a chance even if their base priority is lower.


Note 2825.10, 26-Apr-1997
Hunt: Look at CLASS Scheduling
------------------------------
    I would agree with the earlier advice on looking up class scheduling. 
    This is somewhat new so you won't hear much about it.  I believe it may
    do almost exactly what you are asking for by ensuring everyone gets a
    fair share of the system.  You can tailor it and there are hooks for
    you to set up a custom schedule.  Digital may have some other packages
    to help with this as well.


Note 2825.11, 27-Apr-1997
Esbensen: Dynamic Load Balancer is built to do just what you are asking...
--------------------------------------------------------------------------
    Hello,
    
    Our Dynamic Load Balancer is specifically designed to make sure that
    all users get a "fair shake" at the CPU...and to even out "bursty"
    response times.
    
    More information can be seen at:
    
      http://www.ttinet.com/
    
    Dan Esbensen
    Designer of Dynamic Load Balancer...


The DECUServe Journal  June, 1997                               Page 8
Problem Copying Backup Saveset


  Problem Copying Backup Saveset
  ------- ------- ------ -------

  Abstract:

  For some of you, this next short item may fall under the category of
  "I knew that" -- but sometimes a little reminder about those obscure
  little gotchas is in order.  So how many potential problems can you
  think of when using the COPY command to move a BACKUP saveset from
  tape to disk?


  Participants:

  Arnold De Larisch, Linwood Ferguson, Terry Kennedy, Larry Kilgallen.

  Conference:  VMS


Note 2821.0, 13-Apr-1997
Ferguson: Copy backup saveset to disk, illegal record?
------------------------------------------------------
    I thought it was always possible to copy VMS Backup savesets from tape
    to disk by just mounting the tape and using COPY (+/- those continued
    on second tapes).
    
    I've tried it a couple times lately on a new TZ887 drive (not sure if
    that's related) and I get thebelow.  I've done this with multiple tapes
    (though similar in format).  In all cases the tape reads fine if I do a
    BACKUP command to read it.  I've tried it with and without the /BLOCK.
    
    What am I missing?
    
    PS. These were created with 
    
        backup/image device magtape:arc#/save /block=65534
    
    $ mount/over=id magtape/block=65534
    %MOUNT-I-MOUNTED, ARC mounted on _$1$MUA0: (HSJ003)
    $ reca cop 
    $ copy magtape:[000000]*. sys$scratch:/log
    %COPY-E-OPENIN, error opening $1$MUA0:[]ARC3.;1 as input
    -RMS-F-IRC, illegal record encountered; VBN or record number = 0
    %COPY-E-OPENIN, error opening $1$MUA0:[]ARC4.;1 as input
    -RMS-F-IRC, illegal record encountered; VBN or record number = 0
    %COPY-E-OPENIN, error opening $1$MUA0:[]ARC5.;1 as input
    -RMS-F-IRC, illegal record encountered; VBN or record number = 0
    %COPY-E-OPENIN, error opening $1$MUA0:[]ARC6.;1 as input
    -RMS-F-IRC, illegal record encountered; VBN or record number = 0
    

The DECUServe Journal  June, 1997                               Page 9
Problem Copying Backup Saveset


Note 2821.1, 13-Apr-1997
Kennedy: Recordsize problem
---------------------------
  Isn't the maximum RMS recordsize for disk records 32K?


Note 2821.2, 13-Apr-1997
Ferguson: That sounds reasonable
--------------------------------
    Now that could be.  I just kept staring at the "OPENIN" part.
    
    If I can ever get this whole tape restored I'll experiment (it's got a
    ba-zillion little files, and takes foreever to restore, but isn't all
    that large, so I was trying to get each saveset to disk and restore
    them separately).


Note 2821.3, 14-Apr-1997
De Larisch: VMS Copy as a record limit of 32256
-----------------------------------------------
>     I thought it was always possible to copy VMS Backup savesets from tape
>     to disk by just mounting the tape and using COPY (+/- those continued
>     on second tapes).
    
No it's not ... the LARGEST block size that VMS Copy is 32256.
You may want to use SAVESET manager to rewrite the tapes with a smaller block 
size.


Note 2821.4, 14-Apr-1997
Kilgallen: Another limitation on COPY
-------------------------------------
Although not related to the problem you encountered, the COPY method
also requires that there be no retries in the writing of blocks on
the tape.  The only way I know to guarantee that is to make use of
the /INTERCHANGE qualifier (since it was created to facilitate SDC
duplication), but that has the (potentially) undesired side effect
of preventing the transfer of ACL information.


  Web Page Expiration
  --- ---- ----------


The DECUServe Journal  June, 1997                              Page 10
Web Page Expiration


  Abstract:

  Recently, DECUServe has been the site of a fair amount of
  Web-related activity, though some of you may not have noticed.  A
  small group of dedicated volunteers have been working on a project
  to make some of DECUServe's rich content available on the Web.  This
  next exchange concerns one of the "details of implementation" that
  inevitably appear in such undertakings.


  Participants:

  Charlie Byrne, Bob Hassinger, Lynda Peach, Don Vickers.

  Conference:  WWW


Note 119.0, 10-Mar-1997
Vickers: HTML: Web page expiration options
------------------------------------------
    Is there a way to specify an expiration time for a page?  I seem to
    recall a META tag for EXPIRE but cannot find it.

    My goal is to implement some way to help clients of the DECUServe Web
    pages to get the latest copies of the pages.  The various pages are
    generated on a irregular basis.  The /conferences/index.html top level
    page is generated every few hours and each conference's pages are
    generated when a change is found in that conference.  In some cases
    this is in a few hours and in other cases virtually never.

    One approach would be to set some expected 'expiration' time on each
    page as it is generated.  Another might be to force the pages from the
    client and server caches somehow.

    We do not wish to use cookies.  Some of the clients have made it clear
    that cookies are not acceptable and some clients use browsers without
    cookie support.

    Thanks for any ideas and suggestions,
    don


Note 119.1, 10-Mar-1997
Byrne: HTTP for expire, maybe not from HTML
-------------------------------------------
    This is from Lincoln Stein's page,
    http://www-genome.wi.mit.edu/ftp/pub/software/www/
    It may refresh your memory:
    
    Creating the HTTP Header 
    
    Creating the Standard Header for a Virtual Document 


The DECUServe Journal  June, 1997                              Page 11
Web Page Expiration


       print $query->header('image/gif');
    
    This prints out the required HTTP Content-type: header and the
    requisite blank line beneath it. If no parameter is specified, it
    will default to 'text/html'. 
    
    An extended form of this method allows you to specify a status code and
    a message to pass back to the browser: 
    
       print $query->header(-type=>'image/gif',
                            -status=>'204 No Response');
    
    This presents the browser with a status code of 204 (No response).
    Properly-behaved browsers will take no action, simply
    remaining on the current page. (This is appropriate for a script that
    does some processing but doesn't need to display any
    results, or for a script called when a user clicks on an empty part of
    a clickable image map.) 
    
    Several other named parameters are recognized. Here's a contrived
    example that uses them all: 
    
       print $query->header(-type=>'image/gif',
                            -status=>'402 Payment Required',
                            -expires=>'+3d',
                            -cookie=>$my_cookie,
                            -Cost=>'$0.02');
    
    -expires
    
    Some browsers, such as Internet Explorer, cache the output of CGI
    scripts. Others, such as Netscape Navigator do not. This
    leads to annoying and inconsistent behavior when going from one browser
    to another. You can force the behavior to be
    consistent by using the -expires parameter. When you specify an
    absolute or relative expiration interval with this parameter,
    browsers and proxy servers will cache the script's output until the
    indicated expiration date. The following forms are all valid for
    the -expires field: 
    
            +30s                              30 seconds from now
            +10m                              ten minutes from now
            +1h                               one hour from now
            -1d                               yesterday (i.e. "ASAP!")
            now                               immediately
            +3M                               in three months
            +10y                              in ten years time
            Thursday, 25-Apr-96 00:40:33 GMT  at the indicated time & date
    
    When you use -expires, the script also generates a correct time stamp
    for the generated document to ensure that your clock
    and the browser's clock agree. This allows you to create documents that
    are reliably cached for short periods of time. 


The DECUServe Journal  June, 1997                              Page 12
Web Page Expiration


    CGI::expires() is the static function call used internally that turns
    relative time intervals into HTTP dates. You can call it
    directly if you wish. 
    

Note 119.2, 10-Mar-1997
Peach: Date must be GMT.
------------------------
    In addition to Charlie's note ....
    
    The Expires header is used by the proxy server as a mechanism to keep
    caches up-to-date.  Example:
    
    <META http-equiv="Expires" Contents="Thu, 10 Mar 1997 08:00:00 GMT">
    
    The proxy server discards the document at the indicated time.  There is
    one important note -- the exact format of the dat is specified by the
    standard, and the date MUST ALWAYS be in Greenwich Mean Time (MGT).
    
        The above is from Webmaseters Expert Solutions.
    
    Will be very interested to know how this works Don.  
    Lynda


Note 119.3, 10-Mar-1997
Hassinger: 
-----------
    I had intended to answer that question a few days ago - guess I did
    not.

    Yes, you can put an expires in the headers (take care about the
    particular CGI scripting support you are using - the code shown might
    not quite work on VMS executing DCL for example - every platform (and
    server) provides different CGI support (although a good many provide
    pretty similar support)

>    <META http-equiv="Expires" Contents="Thu, 10 Mar 1997 08:00:00 GMT">

    And this points out the potential use of a META tag to effectively
    embed the header information within the HTML code.  The question WRT
    how well you can depended on the META http-equiv working.  My
    impression is that it may not be fully supported by everything.  I am
    not sure what its standardization status is.


The DECUServe Journal  June, 1997                              Page 13
NT 4.0 SP3 and PATHWORKS


  NT 4.0 SP3 and PATHWORKS
  -- --- --- --- ---------

  Abstract:

  A report of a problem with PATHWORKS 4 SMB authentication after
  applying an NT service pack, how to fix it, and how to recover when
  the published fix doesn't work.


  Participants:

  Kevin Angley, Paul Flaherty Jr.

  Conference:  MS_WINDOWS_OS


Note 285.0, 16-May-1997
Angley: NT4.0 SP3 breaks Pathworks 4 SMB authentication
-------------------------------------------------------
    NT 4.0 SP3 having now been released ( 
    ftp://ftp.microsoft.com/bussys/winnt/winnt-public/fixes/usa/nt40/ussp3),
    I installed it on my NT 4.0 workstation. Now, I am unable to connect to
    Pathworks 4 file services using the \\node\share%user construct. The
    error says that I am unable to login from this workstation.
    
    The release notes indicate changes in SMB authentication and reference
    two knowledge base articles. Both article numbers they give (Q161372
    and Q166730) are wrong.
    
    Anyone know what registry hack is necessary to fix this problem?


Note 285.1, 20-May-1997
Angley: Registry entry doesn't fix it for me
--------------------------------------------
    The Knowledge Base article finally became available. It says to hack
    the registry as follows:
    
    Run Registry Editor (Regedt32.exe).
    From the HKEY_LOCAL_MACHINE subtree, go to the following key:
    \system\currentcontrolset\services\rdr\parameters
    Click Add Value on the Edit menu.
    Add the following:
          Value Name: EnablePlainTextPassword
          Data Type: REG_DWORD
          Data: 1
    Click OK and then quit Registry Editor.
    Shut down and restart Windows NT.
    
    I did so, and rebooted. It did not correct the problem. DEC DSN says
    that this should have corrected the problem. Anyone else seeing this?


The DECUServe Journal  June, 1997                              Page 14
NT 4.0 SP3 and PATHWORKS


Note 285.2, 22-May-1997
Flaherty: Same problem with SAMBA
---------------------------------
I had the same problem connecting to SAMBA for VMS, but the above hack
fixed it.


Note 285.3, 23-May-1997
Angley: Works for most
----------------------
    Apparently this fix works for most people. On a particular workstation
    of mine, it did not fix it. I was, however, able to fix it by
    recovering the rdr.sys file from the uninstall directory.


  The Deadly DELNI
  --- ------ -----

  Abstract:

  We now date ourselves when we admit to having experience with
  certain kinds of networking hardware -- and DELNIs and DEMPRs are
  surely on that list.  The following discussion concerns a plan for
  replacing some "vintage" equipment, and even (perhaps surprisingly)
  flushes out some defenders of the old boxes.


  Participants:

  Gus Altobello, Linwood Ferguson, Jack Harvey, Ken Johnson, Terry
  Kennedy, Milton Lopez, Norm Raphael, Brian Tillman.

  Conference:  DEC_NETWORKING


Note 1290.0, 8-May-1997
Lopez: Die, DELNI, die!
-----------------------
Will the following configuration work (DS200 = DECserver 200)?


                     |
      DS200----T-----|  T = AUI - BNC transciever
      DS200----T-----|
      DS200----T-----|
                     |
      DEMPR----T-----| <- Thinwire backbone


The DECUServe Journal  June, 1997                              Page 15
The Deadly DELNI


                     |
      uVAX-----------|
                     |
      Cat5 hub-------|  This will be added as part
                     |  of a migration to Cat5


Just for background I still have the DS200's hanging off of a DELNI
(which I would like to get rid of) like this:

              H4000          H4000
        --------O---------------O------ Thickwire backbone
                |               |
                |               |
                |               |
              uVAX            DELNI
                              | | |----DS200   (Some terminals
                              | | |----DS200    still in use)
                              | |------DS200
                              |          .
                              |          .
                            DEMPR
                            | | |----
                            | | |----   Thinwire  (lots of
                            | |------   segments   PCs)
                            |


Note 1290.1, 8-May-1997
Johnson: Looks good
-------------------
The proposed new configuration looks much better than the old.  Our
configuration is smilar to your new configuration, except that our multiport
thinwire repeaters are from Allied Telesis, not Digital.


Note 1290.2, 9-May-1997
Harvey: Go for it
-----------------
    The new configuration looks good, but I don't see it as "much better"
    than the old, which looks good to me, too.  What's wrong with it, Ken?


Note 1290.3, 9-May-1997
Kennedy: 
---------
  You can't hang repeaters/hubs off DELNI's. Repeat after me: "DELNI's are
an abomination". They're "multiport AUI fanout devices", a category of
device that isn't mentioned in the 802.3 specs.


The DECUServe Journal  June, 1997                              Page 16
The Deadly DELNI


Note 1290.4, 9-May-1997
Johnson: Our experience
-----------------------
We used to have cascaded DELNIs, with a multiport repeater hanging off of the
top-level DELNI.  When we switched to a configuration with the MPR as the 
backbone, our Ethernet network was more reliable.  Later, when we needed more
Thinwire legs, we went to a Thinwire backbone, with MPRs off of that, similar
to the proposed new configuration. 


Note 1290.5, 11-May-1997
Harvey: Bean counting engineering
---------------------------------
>  You can't hang repeaters/hubs off DELNI's. 
    
    Nonsense.  I've done it for years.  So what if they aren't mentioned in
    the specs?  Why does the spec make the electrons dance nicer?


Note 1290.6, 11-May-1997
Kennedy: 
---------
  There are a very few number of specific circumstances (one?) where you can
do it. It involves an EtherCork (DEC test adapter) in the global port, the
DELNI in global mode (but not connected to anything upstream) and a single
level of Ethernet (not 802.3) repeater/hub devices connected.

>  So what if they aren't mentioned in
>     the specs?  Why does the spec make the electrons dance nicer?

  I find this very odd coming from you, given the "must absolutely be a 
supported configuration" message you're sending about your environment
elsewhere on DECUServe.

  I doubt lima beans are mentioned in the spec either, and using them will
break your network 8-) Seriously, the spec states what designers have to
do to be interoperable, and what sorts of configurations are valid. You
can occasionally break the rules and get away with it.

  What makes the DELNI so evil is that it will often appear to work just
fine, until traffic grows or some apparently unrelated change is made
elsewhere on the network. Also, there are a lot of subtle issues with the
DELNI that don't get factored in (for example, what's the equivalent tran-
sceiver cable length of a DELNI?) that can cause you to unknowingly build
an invalid configuration. Again, that's beyond the other aspects of the
DELNI that make them unsuitable.

  The DELNI was a good solution for its time - when thickwire was the only
Ethernet medium and transceivers cost hundreds of dollars. It made sense
to use a DELNI as a "virtual backbone". For that time and those slower
systems, it made lots of sense and saved money. When the attached systems
got faster, when the backbone de-virtualized, and when the world switched


The DECUServe Journal  June, 1997                              Page 17
The Deadly DELNI


to 802.3, the DELNI was no longer a good solution. For a trivial amount of
money, you can dump the DELNI - just get up to 8 $29 (list) Allied Telesyn
MX20 TP transceivers, 8 $3 CAT 5 cables, and a $200 twisted pair hub.


Note 1290.7, 11-May-1997
Harvey: Hey, lighten up!
------------------------
>Seriously, the spec states what designers have to
>do to be interoperable, and what sorts of configurations are valid. 

    Terry, this is just not true.  The spec gives bean counter rules for
    people who can't (or don't want to) use their engineering common sense. 
    The electrons don't know about the spec.

>You can occasionally break the rules and get away with it.
         ^^^^^^^^^^^^
    More nonsense.  It depends on the rule.  You can't get a transceiver to
    work without applying power, I agree.  But how about VSWR?  How about
    cable length?  How about the number of taps?  How about the number of
    repeaters?  How about the spacing between taps?  How about cable
    characteristic impedance?  How about wire gauge for 10BaseT?  You know
    perfectly well these rules can all be violated.  You have probably
    violated every one yourself.

    I agree the rules are good guides, and prudent engineering should
    attempt to follow them.  However, engineering is also getting the job
    done.  You seem to advocate (What makes the DELNI so evil...) junking
    lots of perfectly good hardware because you have had a problem with it. 
    Is that good advice?

>  I doubt lima beans are mentioned in the spec either, and using them will
>break your network 8-) 

    Beans?  Galileo fell for that, too, so you're in good company.  :-)


Note 1290.8, 11-May-1997
Ferguson: We're trashing our DELNI's, want some?
------------------------------------------------
>    Terry, this is just not true.  The spec gives bean counter rules for
>    people who can't (or don't want to) use their engineering common sense. 
>    The electrons don't know about the spec.
    
    Jack, perhaps you have a very unique environment where everyone who
    must maintain your network understands these from an engineering
    standpoint well enough to second guess the rules.
    
    We don't.  And while I've not had nearly the experience with DELNI's
    Terry has (or anything else probably), most of our problems have come
    from 2nd or 3rd order changes, where someone "cheats" and does so
    intelligently, then along comes someone else and does something that


The DECUServe Journal  June, 1997                              Page 18
The Deadly DELNI


    appears perfectly valid and everything breaks.  E.g. pulling out the
    loopback connector on a standalone DELNI and switching it out of global
    mode "because I needed a loopback connector and obviously it wasn't
    serving a purpose".
    
    Performance and heartbeat issues are another area.
    

Note 1290.9, 13-May-1997
Altobello: DELNI/DEMPR works fine, if you can handle the "rules"
----------------------------------------------------------------
    Here we have used the DELNI/DEMPR combination extensively over the
    years, and it has worked well for us.  Yes, if you don't disable
    heartbeat (SQE) you can have problems.  Yes, if you cascade the DELNIs
    you have problems.
    
    If you stick a scissors into a wall socket, you have problems, too.
    
    There are specific rules you must follow to get DELNI/DEMPR combos to
    work, and in my experience, and in the experience of others of my
    colleagues here, it has been a perfectly workable combination.
    
    As far as getting "strange problems that are hard to diagnose", our #1
    candidate for that was having a single Ethernet backbone.  This allowed
    any transceiver that went wacky to drop the whole backbone and was hard
    as the dickens to troubleshoot.  That configuration was corrected many
    years ago, five or more.  And since the collapsed backbone that was
    used then had numerous bridged Ethernet segments, we pulled the DELNIs
    and connected the DEMPRs right up.
    
    I suspect there are still DELNI/DEMPR setups somewhere in the network,
    though we have long since gone with other configurations.
    
    But the pair did work, and when it didn't it was usually either someone
    didn't follow some simple rules, or it was the more general problem of
    having a broadcast medium hung all over the place.
    

Note 1290.10, 15-May-1997
Lopez: As I was saying ...
--------------------------
Ok, Ok, so the DELNI is not demon-possesed. May ask another question?
A few notes back in this thread someone talked about conecting a terminal
server directly to a hub using a transciever. This sounds better than my
original suggested config, although it does take up ports on the hub. So,
will this work:

DECServer200 -- Transciever -- Cat5 -- Hub port ?


The DECUServe Journal  June, 1997                              Page 19
The Deadly DELNI


Note 1290.11, 15-May-1997
Raphael: I think so
-------------------
I believe we are doing this now quite comfortably.


Note 1290.12, 15-May-1997
Kennedy: Yup
------------
  Yes. If you turn off heartbeat on the transceiver (most non-DEC ones I've
seen come this way by default) you'll see a warning on the terminal server
(in "show server status" - something like "self-test status: 08-00-00" as I
recall) because the server is Ethernet II, not 802.3. This is harmless and
doesn't affect anything (or you could always enable heartbeat).


Note 1290.13, 19-May-1997
Lopez: Cool!
------------
Thanks again. I can now justify my DECUServe subscrition renewal ... ;)


Note 1290.14, 20-May-1997
Tillman: Another symptom
------------------------
    You'll also notice that the green light blinks instead of being a
    steady green when "heartbeat" is disabled.


  Hooking up to ISDN PRI
  ------- -- -- ---- ---

  Abstract:

  Suppose you have a router with an ISDN PRI (Primary Rate Interface)
  module, and a connection installed by your local telco -- which is
  to say, a jack in the wall.  What do you need to connect things up?
  A cable?  A CSU?  And what exactly does a CSU do, anyway?


  Participants:

  Harris Berkowitz, Linwood Ferguson, Terry Kennedy.

  Conference:  DEC_NETWORKING


The DECUServe Journal  June, 1997                              Page 20
Hooking up to ISDN PRI


Note 1289.0, 1-May-1997
Berkowitz: PRI with CISCO 4700
------------------------------
What is generally used in between the ISDN PRI module on a Cisco 4700 and 
the telco jack?

My Cisco vendor first recommended an AT&T (Lucient) CSU in the $2200 range.
They said that there's a less expensive Adtran solution but haven't heard
back with the details yet.

Anyone else doing PRI with Cisco?


Note 1289.1, 2-May-1997
Kennedy: Replace your vendor
----------------------------
  Just a cable, Cisco part number CAB-7KCT1DB15, $100 list price. You can
make up the cable yourself (we do) - all you need is a DB-15 and a Cat 5
patch cable. Whack one end of the Cat 5 cable off, put the white/blue pair
on DB-15 pins 1 and 9, and the orange/white pair on DB-15 pins 3 and 11.
Polarity (which pin gets the white wire) doesn't matter, though you may
have to swap the 1/9 and 3/11 sets (moving white/blue to 3/11 and so forth).

> My Cisco vendor first recommended an AT&T (Lucient) CSU in the $2200 range.
> They said that there's a less expensive Adtran solution but haven't heard
> back with the details yet.

  I think you need a new vendor who can at least read the Cisco catalog.

  By the way, you can get T1 CSU's (not that this configuration needs one)
for $800 or so - $100 less if you don't need fractional T1.

> Anyone else doing PRI with Cisco?

  No, but we're using the MIP (the Cisco 7xxx version of your 4xxx card)
for T1's which uses identical hardware and cabling as ISDN PRI.


Note 1289.2, 7-May-1997
Ferguson: Really confused, need education before new vendor
-----------------------------------------------------------
    Terry, can you elaborate a bit.  There's a bit more to this story.
    
    I looked in the Cisco CD catalog, and it sure appeared to show that you
    needed a CSU (it just said CSU, nothing about DSU).
    
    The vendor that wanted $2200 for a CSU is what first set bells ringing,
    since we were already getting Crays for far less than that, and had
    several sitting around.  Except those were CSU/DSU's if I'm using the
    right terminology.
    
    After your note, I asked Cisco's TAC, telling them we had some


The DECUServe Journal  June, 1997                              Page 21
Hooking up to ISDN PRI


    seemingly conflicting info whether we just needed a cable or CSU or
    both.  Their very terse response was that we needed a CSU (nothing
    about any particular types).
    
    Maybe what we need is an education (and maybe our vendor does as well,
    but I can't help him).  My limited understanding was that the CSU and
    DSU are really separate things that often come in the same box, and
    that the PRI interface on the Cisco takes care of the DSU
    part. But not sure about the CSU part, as I'm a bit weak on what
    exactly a CSU does.
    
    Then there's the simple issue that the Cisco catalog expects a 15pin to
    15pin cable from the PRI interface to the CSU, and any CSU/DSU we have
    has a V35 cable -- another reason I'm assuming that we need a CSU-only
    not a CSU/DSU.
    
    Can you or someone elaborate a bit on how this stuff all works, and (if
    it really is) how a CSU can be optionally replaced with a cable?   Am I
    right in my highly vague idea of the CSU/DSU portions (and might
    someone give a better definitino of what each portion actually does)?


Note 1289.3, 8-May-1997
Kennedy: Gory details explained
-------------------------------
>     Terry, can you elaborate a bit.  There's a bit more to this story.

  Sure.

>     I looked in the Cisco CD catalog, and it sure appeared to show that you
>     needed a CSU (it just said CSU, nothing about DSU).

  That is indeed what the catalog says. However, I've never used a CSU with
these and nobody (among the other ISP's I know) has either. More on this
below.

>     The vendor that wanted $2200 for a CSU is what first set bells ringing,
>     since we were already getting Crays for far less than that, and had
>     several sitting around.  Except those were CSU/DSU's if I'm using the
>     right terminology.

  Right - a DSU adapts an interface (for example, V.35) to the physical trans-
mission channel (like a T1), while a CSU provides isolation and supports
diagnostic functions like loopback and the T1 performance monitoring counters.

>     After your note, I asked Cisco's TAC, telling them we had some
>     seemingly conflicting info whether we just needed a cable or CSU or
>     both.  Their very terse response was that we needed a CSU (nothing
>     about any particular types).
>     
>     Maybe what we need is an education (and maybe our vendor does as well,
>     but I can't help him).  My limited understanding was that the CSU and
>     DSU are really separate things that often come in the same box, and


The DECUServe Journal  June, 1997                              Page 22
Hooking up to ISDN PRI


>     that the PRI interface on the Cisco takes care of the DSU
>     part. But not sure about the CSU part, as I'm a bit weak on what
>     exactly a CSU does.

  Here's the scoop: First, you can't use a DSU (or CSU/DSU) with the chan-
nelized T1 products because the CT1 cards need access to various parts of
the T1 protocol that are stripped out by a DSU. Remember, a DSU provides
you with (assuming B8ZS/ESF T1's) 1536KB as one "lump", or some subrate of
that if you're using fractional T1. It doesn't provide delineation of the
individual 56 or 64KB slots.

  Next, the CT1 cards actually speak "DSX-1", which is the way T1's show
up at central offices, colocation points, and out of things like M13 mux-
es. Some things that speak DSX-1 don't bother implementing things like
loopback or the performance monitoring registers (since these devices are
intended for CO or colocation use, they don't have to comply with end user
requirements). The CT1 cards *do* support loopback and the performance
monitoring registers.

  The other issue is that customer T1 spans can be delivered "wet" or "dry".
"Wet" implies that there's simplex power +/- 130V on the cable for powering
repeaters, etc. that the telco might need between them and your site. Almost
all modern telco-supplied T1's have some sort of network interface unit at
the demarcation point which can be looped up/down by telco control, as well
as stripping out any simplex power.

  Since telcos will usually test to this NIU and if it tests good, they say
"problem's in your cable or equipment", so the diagnostic facilities that
might not be in another DSX-1 device (but *are* in the CT1) don't really 
matter.

>     Then there's the simple issue that the Cisco catalog expects a 15pin to
>     15pin cable from the PRI interface to the CSU, and any CSU/DSU we have
>     has a V35 cable -- another reason I'm assuming that we need a CSU-only
>     not a CSU/DSU.
>     
>     Can you or someone elaborate a bit on how this stuff all works, and (if
>     it really is) how a CSU can be optionally replaced with a cable?   Am I
>     right in my highly vague idea of the CSU/DSU portions (and might
>     someone give a better definitino of what each portion actually does)?

  Hopefully the above clarifies things. I'd have no qualms whatsoever about
just making up the cable as I've described and using it - I've done this with
the Cisco 7xxx MIP card (which is the same as the 4xxx CT1 except it's for
the 7xxx and has 2 T1 ports instead of 1). At our colocation facility in
NYC, there are 3 other ISP's with MIP cards (a total of 15 more MIP cards -
30 T1's) all doing this.

  If you want a cable for this, drop me a note and I'll make one up and send
it to you. Just let me know how long you want it. You might also want to
make sure that the T1 the phone company is giving you doesn't have simplex
voltage on it (you might be able to see the setting if it's in a clear case
- look for options like "SPAN power to CPE", etc. or call your phone company
and ask - it should be marked in their records. If you have a data installer


The DECUServe Journal  June, 1997                              Page 23
Hooking up to ISDN PRI


coming out regularly, he'll be able to check and let you know. Or you can
use a meter (set to a DC range >= 300V) and check the jack - no pin should
have more than about 6V to any other pin. Remember, you're looking at a high
voltage here, so be careful. [This is one of the reasons T1's usually *don't*
get handed to customers with simplex power on 'em.]


Note 1289.4, 19-May-1997
Ferguson: One day we'll find a way to stump Terry, but not this time
--------------------------------------------------------------------
>patch cable. Whack one end of the Cat 5 cable off, put the white/blue pair
>on DB-15 pins 1 and 9, and the orange/white pair on DB-15 pins 3 and 11.
>Polarity (which pin gets the white wire) doesn't matter, though you may
>have to swap the 1/9 and 3/11 sets (moving white/blue to 3/11 and so forth).
    
    Bingo.


Note 1289.5, 19-May-1997
Kennedy: 8-)
------------
>         -< One day we'll find a way to stump Terry, but not this time >-

  But you're having so much fun trying... 8-)

>     Bingo.

  Out of fuel, or working? [From a computerist pilot, it's hard to tell]


Note 1289.6, 20-May-1997
Ferguson: I never understood the term "bingo fuel" anyway
---------------------------------------------------------
    It's working.
    
    I'm running out of fuel rapidly; sleep must come sometime this week (I
    hope).
    
    So both interpretations are valid.
    
    On a related subject as to "working".  The US West folks called me
    back.  Now remember that I had this connected and had calls going in
    both directions just fine.  They decided that it was mis-configured. 
    We were configured for "NI2" (who knows why, they say that's what we
    ordered, which is possible), but that our router can't do NI2 (which I
    think is correct), so that it cannot be working.  So they want to 
    change it to be "custom" so it will work.  Except its working.
    But "it can't work that way".  Sigh.  Anyone want to take bets on 
    what happens after they "fix" it?
    

The DECUServe Journal  June, 1997                              Page 24
Laser Printer Error Code


  Laser Printer Error Code
  ----- ------- ----- ----

  Abstract:

  The mysteries of the "50 SERVICE" message displayed on DEClaser 2150
  (and other) laser printers are revealed in the following notes,
  including recommended corrective actions.


  Participants:

  Rob Aldridge, Joe Gallagher, Terry Kennedy.

  Conference:  HARDWARE_HELP


Note 2141.0, 5-May-1997
Gallagher: What is meaning of DEClaser 2150 error messages?
-----------------------------------------------------------
When a DEClaser printer gives an error message of the form "NN SERVICE"
where NN is between 50 and 99, does anyone know the meaning of these
messages.


Note 2141.1, 5-May-1997
Kennedy: What's the exact code?
-------------------------------
  If these are the printers that are descended from Canon engines (a la
the HP LaserJet II - if it takes a HP II cartridge it's one of these),
they indicate various problems - but you knew that. If you post the
exact code, I'll look it up in my HP/Canon service guides (this assumes
DEC didn't just change the codes to be perverse).

  Some sample malfunctions are fuser too hot/cold, DC power problems.
etc.


Note 2141.2, 6-May-1997
Gallagher: Error code is "50 SERVICE"
-------------------------------------
   Yes, the DEClaser 2150 uses the Canon engine and has the same "guts"
   as an HP II.

   The error code is "50 SERVICE".  And thanks for any help you can 
   give.


The DECUServe Journal  June, 1997                              Page 25
Laser Printer Error Code


Note 2141.3, 7-May-1997
Aldridge: You may need outside help to fix error 50
---------------------------------------------------
    From an HP 4 online manual:
    50, 57 or 58 SERVICE        The printer identified an internal service
    error.      If any of these errors appears, switch the printer off and
    then back on. If the error continues, call you dealer or HP service
    representative.Note   To clear the 50 SERVICE error the printer must be
    off for at least 10 minutes.
    
    From a Laserjet Plus online manual:
    50, 51, 52, 53, 54, 55, 60, 61, 62, 63, 64, 65, 67  Operational error 
    Contact HP Service
    
    
    Just fyi - the Microsoft Technet subscription CD-ROM has an Ultimate
    Printer Manual - which provides on-line manuals for most of the HP and
    some other printers.
    

Note 2141.4, 7-May-1997
Gallagher: Thanks for help
--------------------------
Thanks for the infomation.  I was expecting to have to take the 
printer in for service.  However, I was hoping for some understanding
of the problem.  The manuals are not very forthcoming with detailed
information.  Perhaps there is not _THAT_ much on board diagnostic in
these older printers; the printer diagnostics knows something is wrong, 
but it may not be able to tell very much about what is really wrong.

Again, thanks.


Note 2141.5, 7-May-1997
Kennedy: 
---------
  50 SERVICE is a fuser malfunction. The troubleshooting table says:

1) Is the fusing assembly correctly seated onto its connectors on the
   AC Power Mudule and base plate (left and right ends)?

2) Is the +24A (sic) voltage present?

3) Is the circuit breaker on the AC Power Module tripped?

4) Is the Thermistor defective?

5) is the Fuser Bulb open?

6) is the Thermoprotector open?

7) Are the cable assemblies defective?


The DECUServe Journal  June, 1997                              Page 26
Laser Printer Error Code


8) Is the AC Power Module defective?

9) Is the DC Controller PCA defective?

  The troubleshooting info runs for just 5 pages, so if you send me your
FAX number, I'll FAX it to you.

  Most "printer repair" places (at least around here) are crooks. You're
probably better off fixing it yourself (particularly since it's a DEC
variant of the unit). All of the possible causes for this problem are
generic parts, and you can get them from a place called Parts Now! (see
http://www.partsnowinc.com). A rebuilt fuser (with excange) is about $29-
$39, depending on whether you want new cosmetic parts. They can also do
exchanges on the other parts, or sell you individual fuser parts if you
want.


  Alpha SMP Performance
  ----- --- -----------

  Abstract:

  The following discussion of Alpha (and VAX) performance issues on
  SMP multiprocessor machines split off from another topic (Note 2125,
  on a rather arcane matter of switching CI-based systems between
  production and development environments) in the same conference.


  Participants:

  David Campen, Jack Harvey, Larry Kilgallen, Glenn Zorn.

  Conference:  HARDWARE_HELP


Note 2127.0, 7-Apr-1997
Campen: How well does Alpha SMP perform?
----------------------------------------
>                               -< HARDWARE_HELP >-
>=========================================================================
>Note 2125.0                       CI Bus Switch                        
>EISNER::HARVEY "Jack at SIAC"                        53 lines  
>-------------------------------------------------------------------------
>
>    So far, so good.  For this phase, the two new hardware nodes (Alpha
>    4100s with 4 CPUS and 1 GB memory, if you must know) will become NI
>    cluster members of an existing VAX cluster at V6.2.  They will have
    
    I'm curious, have any benchmarks been done to determine how many users
    a 4 CPU system will support vs. a 1 or 2 CPU system.


The DECUServe Journal  June, 1997                              Page 27
Alpha SMP Performance


Note 2127.1, 12-Apr-1997
Harvey: What's a user?
----------------------
    Oh, we had about five users with two CPUs, and it went up to about
    eight with four.

                                :-)

    [page break]

    Sorry to fool around.  Seriously, we are so far from a typical shop
    that the concept of number of users simply doesn't arise here.  There
    are about eight different types of interfaces and a single user might
    be active on all eight at once.  About 40 nodes are supporting those
    interfaces: X-terminals, VT420's, printers, wireless PCs.  Maybe a
    thousand people in one huge room, where whistles and hand signals are
    used as much as Ethernet...

    So far, our use of the 4100s has been limited to acting as Sybase
    database servers.  The only direct login users are operators.  The
    biggest challenge in using a four CPU node is getting Sybase to keep
    all four doing useful work.  It seems to prefer to contemplate its
    navel.


Note 2127.2, 12-Apr-1997
Zorn: Users?
------------
    Jack is right in .1 what is a user...
    
    Our biggest problem with SMP boxes is CPU 0 hitting 70 to 100% on the
    interrupt stack. (This is supposed to be fixed in V7.1 which will allow
    for more of the IO to be performed on any CPU) Once the interrupt stack 
    hits about 50 or 60% performance actual will start to drop on the system 
    due to overhead synching. I can actually map the kbytes tranfered
    dropping as the interrupt stack increases. 
    
    On our 8480s with 6 Gbytes of memory I can handle about 250 clients who
    run an average of 2 processes each. Most of this is contrained by VMS
    and memory limitations of V6.2 in configuring thier virutal address set
    to 1.5 million pages. Again hopefully V7.1 will lift this and then
    adding in more memory would lift that number. The systems are currently
    bound by the above two factors when fully loaded.
    
    
    Note: The majority of the IO is going through 4 CIPCAs to HSJ
    controllers for disk read/write.


The DECUServe Journal  June, 1997                              Page 28
Alpha SMP Performance


Note 2127.3, 12-Apr-1997
Kilgallen: Buffer Objects may help
----------------------------------
Some applications will reduce such inner-mode overhead by switching
to use buffer objects instead of traditional SYS$QIO.  As I recall
there were supposed to be enhancements to this in V7.1.


Note 2127.4, 13-Apr-1997
Campen: Limitations inherent to Alpha.
--------------------------------------
>Some applications will reduce such inner-mode overhead by switching
>to use buffer objects instead of traditional SYS$QIO.  As I recall
>there were supposed to be enhancements to this in V7.1.

The above is a problem which limits SMP performance gains and is common to both
VAX and Alpha implementaions of VMS. Perhaps this will be improved in VMS 7.1.

The Alpha architecture, I believe, has its own limitations which I expect will
limit SMP performance improvements no matter what is done to the Operating
System. Consider multiple threads of an application or multiple applications 
accessing a common data structure. To insure that the common data seen by the
threads or applications is consistent it is necessary to execute Memory Barrier
instructions to flush the pipelines and memory caches on the CPUs but it is
exactly these pipelines and memory caches that give the Alpha RISC architecture
its performance.


Note 2127.5, 13-Apr-1997
Kilgallen: Why is VAX slow ?
----------------------------
> The above is a problem which limits SMP performance gains and is common to
both
> VAX and Alpha implementaions of VMS. Perhaps this will be improved in VMS 7.1.

I am not sure that Buffer Objects are available to customers on VAX.

> The Alpha architecture, I believe, has its own limitations which I expect will
> limit SMP performance improvements no matter what is done to the Operating
> System. Consider multiple threads of an application or multiple applications 
> accessing a common data structure. To insure that the common data seen by the
> threads or applications is consistent it is necessary to execute Memory
Barrier
> instructions to flush the pipelines and memory caches on the CPUs but it is
> exactly these pipelines and memory caches that give the Alpha RISC
architecture
> its performance.

The same problem exists on any SMP computer.  You do not see it on VAX
because it is arbitrated in hardware.  That is one of the many factors
which makes VAX slower than Alpha.  What Alpha has done is expose these
considerations to software so the flushing operations only take place


The DECUServe Journal  June, 1997                              Page 29
Alpha SMP Performance


when absolutely necessary.

The answer to this situation is a common one which has been around for
years -- careful application design.


  RAID System Disk?
  ---- ------ -----

  Abstract:

  Does the idea of making your system disk a RAID set make you at all
  nervous?  Would you be less nervous if you could ask trusted peers
  and experts about it first?  That's what DECUServe's there for....


  Participants:

  David MacLean, Bill Norton, Keith Parris.

  Conference:  HARDWARE_HELP


Note 2134.0, 17-Apr-1997
Norton: RAIDset for system disk?
--------------------------------
    I'm running out of empty disk slots on my HSD30, and wondering about
    using a 3-disk RAID5 set of RZ29's as - gasp - the system disk.
    Has anyone tried this?
    Is performance really as bad as "common knowledge" advises?
    How about if the page & swap files are off the system disk - would it
    still be unthinkable?


Note 2134.1, 17-Apr-1997
Parris: Go for it
-----------------
At one of my current client sites there are multiple system disks which are HSJ
controller-based stripesets (2 and 3 members) of 2-member mirrorsets, and they
work fine.  VMS just thinks these are large disks, and it's very handy when you
have several large-memory nodes in a cluster with large dumpfiles to store.
(The client doesn't have RAID-5 keys or I might well have used that.)  The
controller's write-back cache (which is a prerequisite for either RAID-5 or
mirroring anyway) tends to basically hide the latency of writes (unless and
until you get to the point where the drives in the array behind the controller
get saturated), and the extra spindles in either type of array will help with
the performance of reads. 

System disks tend to have a fairly small percentage of writes.  If you're
worried, you could always move most of the read/write files (page, swap,


The DECUServe Journal  June, 1997                              Page 30
RAID System Disk?


SYSUAF, queue files, operator logs, accounting files, etc.) off the system disk
first, to maximize the read/write ratio.

Particularly if you leave the page/swap files on the system disk, but even in
general, considering that PFCDEFAULT is 64 blocks by default, you should be
sure to raise MAXIMUM_CACHED_TRANSFER_SIZE on the unit from its default size of
32 so that page faults (PFCDEFAULT in size) or modified page writer writes (127
blocks) don't bypass the cache.


Note 2134.2, 18-Apr-1997
MacLean: I'm doing it, and enjoying it
--------------------------------------
    I've run my production nodes (dual VAX 7630) with a RAID5 system disk,
    via CI-connected dual HSJ40 in SW500 box for the last couple of years,
    and have not seen any performance problems because of the (HSJ-based)
    RAID5, other than when I turned off WRITEBACK cache for a few minutes
    (and that was very sluggish).
    
    While most of our user files and database stuff resides on other (total
    of six) RAID5 sets, most of my PAGE (and SWAP, not that it gets used)
    files are off the system disk, on non-RAID volumes.


  OSF/1 Disk Woes
  ----- ---- ----

  Abstract:

  In the following, we begin with a disk failure on an OSF/1 system.
  The vendor sends a replacement drive, the drive is installed, and
  the system refuses to come back, preferring to spew cryptic error
  messages at startup and then hang.  Now what?


  Participants:

  Bruce Bowler, Dale Coy, Mike Miller.

  Conference:  UNIX_OS


Note 350.0, 1-Apr-1997
Bowler: cam_logger errors installing OSF/1
------------------------------------------
    Situation...
    
    Alpha "clone" - disk goes bad.  To remove the disk, I had to remove all
    of the cards to drop the cage the drive was in to get to the mounting
    screws on "the other side".  Put everything back together with a new


The DECUServe Journal  June, 1997                              Page 31
OSF/1 Disk Woes


    drive in place.  Boot to the OSF installation CD.  Tell it I want to do
    a "basic" installation.  It asks a few questions about which disk is to
    be the system disk (rz3).  Then the dia(mono?)log goes something like 
    this...
    
    initializing the system disk
          working
    system disk has been initialized
    checking root file system
    cam_logger: CAM_ERROR packet
    cam_logger: bus 0 target 3 lun 0
    ss_perform_timeout
    timeout on disconnect request
    cam_logger: CAM_ERROR packet
    cam_logger: bus 0 target 3 lun 0
    ss_perform_timeout
    timeout on disconnect request
    Reached max abort count scheduled bus reset
    cam_logger: CAM_ERROR packet
    cam_logger: bus 0
    aha_bus_reset
    Resetting the SCSI bus at request
    
    Then everything hangs...  The "activity" led on rz3 is on steady at
    this point.  Apparently the only way to clear it when it gets to this
    point is to power cycle the machine.  The original drive was a DEC
    DSP3210.  The new drive is brand x yy3210 (sorry, I'm not at the
    machine right now and don't have the info written down.)
    
    Any ideas on how to get the system installed on this disk?


Note 350.1, 1-Apr-1997
Bowler: 
--------
    A couple other notes that may help...
    
    The SCSI chain looks like this...
    
        Adaptec AHA 1740/42a+---+disk+----+tape+----+CDROM+----+term pack
        (SCSI ID)                 3         6          4
    
    There is a bank of 3 sets of 10 holes next to where the scsi ribbon
    cable connects to the adapatec card that look like they might be a
    place to put the little termination resistors, but they're empty (and
    they were before too).  I'm wondering f there's a termination issue
    here, but I don't knoe enough about SCSI to know for sure. 
    
    I can get to the CDROM drive with no problem... Don't know about the
    tape drive, but when the system is initting the SCSI during power up
    the lights light on the front of it so I'm pretty sure the controller
    "sees" it.


The DECUServe Journal  June, 1997                              Page 32
OSF/1 Disk Woes


Note 350.2, 1-Apr-1997
Coy: Get one
------------
>    The SCSI chain looks like this...
>    
>       Adaptec AHA 1740/42a+---+disk+----+tape+----+CDROM+----+term pack
>       (SCSI ID)                 3         6          4
    
    I presume that's exactly the way it's cabled -- and presume that all of
    the drives are INternal (hard to tell from your description).
    
>    There is a bank of 3 sets of 10 holes next to where the scsi ribbon
>    cable connects to the adapatec card that look like they might be a
>    place to put the little termination resistors, but they're empty (and
>    they were before too).  I'm wondering f there's a termination issue
>    here, but I don't knoe enough about SCSI to know for sure. 
    
    The rule is that there must be termination on BOTH ENDS and NOWHERE
    ELSE.  [If there are drives on "both sides" of the Adaptec card, then
    there should be no termination "on" the card]
    
    First guess -- there USED to be a terminator plug on the "outside
    connector" of the 1740, and it isn't there now.
    
    Or -- you need termination resistors on the card.
    
    Or -- the cabling isn't really that way.
    
    Of course, I'm not fully familiar with the 1740 -- but on the Adaptec
    www site (obvious address), there are copies of "all of the
    documentation" for all of their cards (in PDF format as I recall).  Go
    get the doc. for the 1740, and check if it has "software termination"
    or "hardware termination".
    
    Advice: get an external terminator plug, attach it to the back of this
    thing, and see if it fixes the problem.


Note 350.3, 1-Apr-1997
Bowler: 
--------
>    I presume that's exactly the way it's cabled -- and presume that all of
>    the drives are INternal (hard to tell from your description).
    
    Yes on both accounts.
    
>    ELSE.  [If there are drives on "both sides" of the Adaptec card, then
>    there should be no termination "on" the card]
    
    All on "one side" of the card.
    
>    First guess -- there USED to be a terminator plug on the "outside
>    connector" of the 1740, and it isn't there now.


The DECUServe Journal  June, 1997                              Page 33
OSF/1 Disk Woes


    I haven't seen one "lying on the floor".
    
>    Or -- you need termination resistors on the card.
    
    Which leads to another question...  Are all termination resistors
    created equal or are some "more equal" than others?
    
>    Advice: get an external terminator plug, attach it to the back of this
>    thing, and see if it fixes the problem.
    
    I would, except it's got this "really weird" (i.e. non-standard)
    external plug, the likes of which I haven't seen before.  
    
    I'll check out their site tonight...


Note 350.4, 11-Apr-1997
Miller: maybe a dying disk?
---------------------------
    I had almost the exact same problem!
    The only way I could clear the "solid lit" drive was power off/on.
    Turns out I had a dying RZ26.  Use some SCU command like this:
    
    SCU> switch /dev/whatever
    SCU> show defects
    
    If the list is big and getting bigger, you probably need to replace the
    drive.
    

Note 350.5, 11-Apr-1997
Bowler: 
--------
    Despite the vendor saying "we tested that disk thoroughly before we
    sent it to you", that's exactly what it was...


  About the DECUServe Journal
  ---------------------------

  The DECUServe Story

  DECUServe is an electronic conferencing system, somewhat related to
  bulletin board systems but much larger and more organized.  It is
  devoted to the general area of computer technology such as systems,
  software, hardware, and communication, in the Digital and related
  third party vendor market area.


The DECUServe Journal  June, 1997                              Page 34
About the DECUServe Journal


  DECUServe also has complete access to and from the Internet.  Usenet
  Newsgroups are accessible using newsreaders from DECUServe and the
  comp.os.vms newsgroup is added to a VAX Notes conference of its own.

  The conferencing system is available nearly 24 hours a day, seven
  days per week.  There is no hourly connect charge.  The subscriber
  pays communication costs to a phone number in eastern Massachusetts.
  Reduced rate communication services are available in some areas and
  INTERNET access is available (node - eisner.decus.org).
  Subscriptions must be used by a single person.  Company or group
  subscriptions are not available, nor may subscriptions be
  transferred.

  DECUServe uses the Digital VAX Notes conferencing software.  We
  currently have over 50 technical conferences available on subjects
  such as Security, the VMS Operating System, ALL-IN-1, Databases,
  Site Management, Personal Computing, DEC Networking, Third Party
  Software, Hardware, Workstations, the World Wide Web and many more.
  Over 130,000 technical notes are on line.  All conferences,
  including the Frequently Asked Questions (FAQ) from the Usenet
  newgroups, are indexed to allow for fast text content searches.

  You can obtain up to the date statistics and information via the
  World Wide Web at the URL
  http://WWW.DECUS.ORG/decus/decusv/index.html which provides a number
  of options.  One option displays the activity in each of the
  technical conference.  Another option allows you to read issues of
  the DECUServe Journal which is published worldwide every month and
  contains samples of the discussions that occur 24 hours a day.

  If you have access to Internet mail, you can receive a DECUServe
  Application form directly.  Send mail to
  application@eisner.decus.org -- the mail text may be blank.  On-line
  subscription information is available in the U.S.  by dialing
  1-800-521-8950 and logging in with username INFORMATION.

  Publication Information

  Topic threads in the DEC Notes conferences on DECUServe are selected
  for publication on the basis of strong technical content and/or
  interest to a wide audience.  They are submitted to the editor from
  various sources, including DECUServe Moderators, Executive Committee
  members, and other volunteers.  Suggestions for inclusion are
  enthusiastically solicited.  Articles selected for publication are
  edited on an OpenVMS VAX system in TPU and then formatted with
  Digital Standard Runoff.


The DECUServe Journal  June, 1997                              Page 35
Contact Information


  Contact Information
  -------------------

  The editors of the DECUServe Journal are Brian and Sherrie McMahon.
  They can be reached by any of the following means:

       mcmahon_b@decuserve.decus.org
       mcmahon_s@decuserve.decus.org
       mcmahonb@decus.org
       griffith@decus.org
       bmcmahon@cisco.com
       Work phone:  +1 408 527 0434