Monday, October 22, 2012

NFS share issues with multiple ip on a host



NAS box has a share which is to be mounted on a client


NAS box: myNasBox
Client: SolZone1


NAS box has provided access to the ip/hostname of the host

(SolZone1:/)# nslookup SolZone1
Server:         10.10.134.3
Address:        10.10.134.3#53

Name:   SolZone1.bc
Address: 45.26.20.216

But the mounting failed with permission error

(SolZone1:/)# mount -F nfs -o rw,bg,soft,vers=3 myNasBox:/nasdump/nasdata /nasdata
nfs mount: myNasBox:/nasdump/nasdata: Permission denied


Issue identified as below

(SolZone1:/)# traceroute myNasBox
traceroute: Warning: Multiple interfaces found; using 10.12.12.12 @ aggr535001:3
traceroute to myNasBox (10.12.11.45), 30 hops max, 40 byte packets
 1  10.13.142.2 (10.13.142.2)  0.713 ms  0.710 ms  0.574 ms
 2  172.27.128.113 (172.27.128.113)  3.354 ms  3.056 ms  2.407 ms
 3  172.27.128.142 (172.27.128.142)  2.375 ms  8.416 ms  2.308 ms
 4  nas-box-14 (10.12.11.45)  1.406 ms  1.481 ms  1.427 ms
(SolZone1:/)#

The traffic comes through the ip 10.12.12.12

(SolZone1:/)# nslookup 10.12.12.12
Server:         10.10.134.3
Address:        10.10.134.3#53

12.142.120.10.in-addr.arpa      name = SolZone1-lb.bc.

Multiple ip's for the host

(SolZone1:/)# ifconfig -a
lo0:7: flags=2001000849 mtu 8232 index 1
        inet 127.0.0.1 netmask ff000000
aggr209001:1: flags=201000843 mtu 1500 index 4
        inet 45.16.20.26 netmask fffffe00 broadcast 45.216.205.255
aggr302001:3: flags=201000843 mtu 1500 index 5
        inet 45.29.1.8 netmask ffffff00 broadcast 45.219.1.255
aggr535001:3: flags=201000843 mtu 1500 index 6
        inet 10.12.12.12 netmask ffffff00 broadcast 10.13.142.255
nxge200006:1: flags=201000843 mtu 1500 index 7
        inet 10.28.18.165 netmask fffffe00 broadcast 10.48.129.255
(SolZone1:/)#


NAS is adapted to permit the ip 10.12.12.12.

Now the mount is successfull

(SolZone1:/)# mount -F nfs -o rw,bg,soft,vers=3 myNasBox:/nasdump/nasdata /nasdata

(SolZone1:/)# bdf /nasdata
Filesystem            kbytes    used   avail capacity  Mounted on
myNasBox:/nasdump/nasdata
                     4017992400 1317611268 2722172800    33%    /nasdata
(SolZone1:/)#

"kill -9" fails... why?


Kill -9 >> Process still alive !!!!


A signal is a message sent to a process to interrupt it and cause a response. If the process has been designed to respond to signals of the type sent it does so; otherwise, it terminates. There are 42 signals, as defined in the signal(5) man page. 


Why sometime process cannot be killed with kill -9?

If the process is hung waiting for I/O (or some other kernel resource) it will behave that way. The problem is the process is hung waiting for an I/O to return and is unable to service the signal. So it never gets killed. Usually reboot is the only solution for this condition.

Sometimes a process will be waiting for kernel to do some I/O and the kernel is waiting for a timeout because of a hardware error, etc. This can happen if you try to do a file copy for instance and a harddrive fails. The 'cp' command may not be killable until the kernel timeout has occurred (this may hang indefinately).

stale NFS mounts will cause the issue if hard mount is used. One way to avoid hung processes is to "soft" mount the NFS mounts. 


https://forums.oracle.com/thread/2398407

http://docs.oracle.com/cd/E19455-01/806-1360/processmanagerusing-62/index.html

Monday, October 15, 2012

HP-UX sw* operations


HP-UX Package install/uninstall/list operations


### Find the available depot on a host

in the local host :

**swlist -i -d @ /var/opt/pkgs/depots/Rel.11.31/apps

In a remote host :

root@MyHPHOST:/hroot# swlist -l depot -s MyHPHOST2 -d /var/opt/pkgs/depots/Rel.11.31/apps
# Initializing...
# Target "MyHPHOST2" has the following depot(s):
  /var/opt/pkgs/depots/Rel.11.23/core
  /var/opt/pkgs/depots/Rel.11.23/apps
  /var/opt/pkgs/depots/Rel.11.31/core
  /var/opt/pkgs/depots/Rel.11.31/apps
  /var/opt/pkgs/depots/Rel.11.11/core
  /var/opt/pkgs/depots/Rel.11.11/apps
  /var/spool/sw
root@MyHPHOST:/hroot#


### Find a specific Package in the depot

root@MyHPHOST:/hroot# swlist -d -l product NMSAP @ MyHPHOST2:/var/opt/pkgs/depots/Rel.11.31/apps
# Initializing...
# Contacting target "MyHPHOST2"...
#
# Target:  MyHPHOST2:/var/opt/pkgs/depots/Rel.11.31/apps
#

  NMSAP         4.0            NetWorker Module for SAP with Oracle
root@MyHPHOST:/hroot#


root@MyHPHOST:/hroot# swlist -d -l product @ MyHPHOST2:/var/opt/pkgs/depots/Rel.11.31/apps | grep NMSAP
  NMSAP                 4.0            NetWorker Module for SAP with Oracle
root@MyHPHOST:/hroot#



### Find the list of files in a specific package

root@MyHPHOST:/hroot# swlist -d -l file NMSAP @ MyHPHOST2:/var/opt/pkgs/depots/Rel.11.31/apps
# Initializing...
# Contacting target "MyHPHOST2"...
#
# Target:  MyHPHOST2:/var/opt/pkgs/depots/Rel.11.31/apps
#

# NMSAP                 4.0            NetWorker Module for SAP with Oracle
# NMSAP.lgto-nmsap      4.0            NetWorker Module for SAP with Oracle
  /etc/init.utl
  /etc/nsrsapsv.cfg
  /etc/nwsldreg.xml
  /opt/networker/bin/backint
  /opt/networker/bin/nsrsapadm
  /opt/networker/bin/nsrsapprobe
  /opt/networker/bin/nsrsapra
  /opt/networker/bin/nsrsaprachild
  /opt/networker/bin/nsrsapsv
  /opt/networker/bin/sapclone
  /opt/networker/bin/sapra.jar
  /opt/networker/bin/sapra_res.jar
  /opt/networker/lib/apps/nmsap/hpux64/libcommonssl.7.6.dev.139.so
  /opt/networker/man/man8/backint.8
  /opt/networker/man/man8/nsrsapadm.8
  /opt/networker/man/man8/nsrsapprobe.8
  /opt/networker/man/man8/nsrsapsv.8
  /opt/networker/man/man8/sapclone.8
  /usr/lib/libnworasap.so
root@MyHPHOST:/hroot#


### Remove a package

root@MyHPHOST:/hroot# swlist -l product | grep NMSAP
  NMSAP                 3.5            EMC NetWorker Module for SAP with Oracle
root@MyHPHOST:/hroot#
root@MyHPHOST:/hroot# swremove NMSAP

=======  08/21/13 14:26:51 METDST  BEGIN swremove SESSION
         (non-interactive) (jobid=MyHPHOST-0432)

       * Session started for user "root@MyHPHOST".

       * Beginning Selection
       * Target connection succeeded for "MyHPHOST:/".
       * Software selections:
             NMSAP.lgto-nmsap,l=/,r=3.5,a=HP-UX_11.XX_64
       * Selection succeeded.


       * Beginning Analysis
       * Session selections have been saved in the file
         "/hroot/.sw/sessions/swremove.last".
       * The analysis phase succeeded for "MyHPHOST:/".
       * Analysis succeeded.


       * Beginning Execution
       * The execution phase succeeded for "MyHPHOST:/".
       * Execution succeeded.


NOTE:    More information may be found in the agent logfile using the
         command "swjob -a log MyHPHOST-0432 @ MyHPHOST:/".

=======  08/21/13 14:27:11 METDST  END swremove SESSION
         (non-interactive) (jobid=MyHPHOST-0432)

root@MyHPHOST:/hroot# swlist -l product | grep NMSAP
root@MyHPHOST:/hroot#



### Install a package



root@MyHPHOST:/hroot# swinstall -s MyHPHOST2:/var/opt/pkgs/depots/Rel.11.31/apps NMSAP

=======  08/21/13 14:30:47 METDST  BEGIN swinstall SESSION
         (non-interactive) (jobid=MyHPHOST-0433)

       * Session started for user "root@MyHPHOST".

       * Beginning Selection
       * Target connection succeeded for "MyHPHOST:/".
       * Source connection succeeded for
         "MyHPHOST2:/var/opt/pkgs/depots/Rel.11.31/apps".
       * Source:
         MyHPHOST2:/var/opt/pkgs/depots/Rel.11.31/apps
       * Targets:                MyHPHOST:/
       * Software selections:
             NMSAP.lgto-nmsap,r=4.0,a=HP-UX_11.XX_64
           + NetWorker.nwr-cbin,r=7.6.3,a=HP-UX_11.XX_64
       * A "+" indicates an automatic selection due to dependency or
         the automatic selection of a patch or reference bundle.
       * Selection succeeded.


       * Beginning Analysis and Execution
       * Session selections have been saved in the file
         "/hroot/.sw/sessions/swinstall.last".
       * "MyHPHOST:/":  1 filesets have a version with a higher
         revision number already installed.
ERROR:   "MyHPHOST:/":  The software dependencies for 1 products or
         filesets cannot be resolved.
       * The analysis phase failed for "MyHPHOST:/".
       * Analysis and Execution had errors.


ERROR:   More information may be found in the agent logfile using the
         command "swjob -a log MyHPHOST-0433 @ MyHPHOST:/".

=======  08/21/13 14:31:03 METDST  END swinstall SESSION
         (non-interactive) (jobid=MyHPHOST-0433)

root@MyHPHOST:/hroot# 

*** Failed becasue of dependencies. To overcomes this, evaluate the dependency condition and exclude it if possible


root@MyHPHOST:/hroot# swinstall -s MyHPHOST2:/var/opt/pkgs/depots/Rel.11.31/apps -x enforce_dependencies=false NMSAP





root@MyHPHOST:/hroot# swlist -l product | grep NMSAP
  NMSAP                 4.0            NetWorker Module for SAP with Oracle
root@MyHPHOST:/hroot#



Wednesday, October 10, 2012

FTP site command



UMASK for ftp can be set globally by adding the proper value for umask in the file /etc/ftpd/ftpaccess


But how can umask be used from a client system to be executed on the remote server instead of the global setting?

To send some commands from client to server where it can be executed.

For eg) To set UMASK for a user on the remote server, use 


ftp> site UMASK



SITE command can be used for a number of other operations as well as seen in the ftp help menu



ftp> site help

214-The following SITE commands are recognized (* =>'s unimplemented).
   UMASK           GROUP           INDEX           GROUPS
   IDLE            GPASS           EXEC            CHECKMETHOD
   CHMOD           NEWER           ALIAS           CHECKSUM
   HELP            MINFO           CDPATH
214 Direct comments to ftp-bugs@solaris1.
ftp>


Friday, September 28, 2012

Pluggable Authentication Module (PAM)


PAM allows various authentication technologies such as Kerberos, RSA, smart cards and DCE into system entry services such as login, passwd, rlogin, telnet, ftp, and su without changing any of these services.

PAM configuration file pam.conf is used to manage how these services use PAM framework. 

How it works?

Application Services (rlogin, telnet, ssh etc...) => PAM API (Refers pam.conf) => PAM Modules to manage the access


The entries in /etc/pam.conf or in PAM policy files are in one of the following formats:

"service-name module-type control-flag module-path module-options"


eg) LDAP configuration in pam.conf.

more /etc/pam.conf | grep ldap

zone03:/root# more /etc/pam.conf | grep ldap
login   auth required           pam_ldap.so.1 use_first_pass
rlogin  auth required           pam_ldap.so.1 use_first_pass
rsh     auth required           pam_ldap.so.1 use_first_pass
ppp     auth required           pam_ldap.so.1 use_first_pass
other   auth required           pam_ldap.so.1 use_first_pass
passwd  auth required           pam_ldap.so.1 use_first_pass

User Creation issues and some checks


(zone01:/)# useradd -u 890 -g 52 -d /twsp/tws -c "Master user" -s /bin/ksh Master
UX: useradd: ERROR: group 582 does not exist.  Choose another.

(zone01:/)# groupadd -g 52 twspgr

(zone01:/)# useradd -u 890 -g 52 -d /twsp/tws -c "Master user" -s /bin/ksh Master
UX: useradd: ERROR: Inconsistent password files.  See pwconv(1M).
(zone01:/)#
(zone01:/)# pwconv
(zone01:/)#
(zone01:/)# useradd -u 890 -g 52 -d /twsp/tws -c "Master user" -s /bin/ksh Master
UX: useradd: ERROR: Inconsistent password files.  See pwconv(1M).

(zone01:/)# cat /etc/passwd |grep Master

(zone01:/)# useradd -u 890 -g 52 -d /twsp/tws -c "Master user" -s /bin/ksh Master
UX: useradd: ERROR: Inconsistent password files.  See pwconv(1M).

(zone01:/)#
(zone01:/)# wc -l /etc/passwd /etc/shadow
      27 /etc/passwd
      27 /etc/shadow
      54 total


(zone01:/)# pwck

uucp:x:5:5:uucp Admin:/usr/lib/uucp:
        Login directory not found

nuucp:x:9:9:uucp Admin:/var/spool/uucppublic:/usr/lib/uucp/uucico
        Login directory not found
        Optional shell file not found
(zone01:/)#
(zone01:/)#
(zone01:/)# ls -ld /usr/lib/uucp
/usr/lib/uucp: No such file or directory

(zone01:/)# grpck


utlitygr::360:cdpuck,cdpucka,cdpucks,cdpuckm
        Group name too long
(zone01:/)#
(zone01:/)#
(zone01:/)# ls /var/spool/uucppublic
/var/spool/uucppublic: No such file or directory
(zone01:/)# ls /usr/lib/uucp/uucico
/usr/lib/uucp/uucico: No such file or directory
(zone01:/)#

Wednesday, September 19, 2012

hacf -cftocmd to recreate the commands from the main.cf file


The whole VCS config can be recreated by recreating the commands from the main.cf file.


Location of config file - /etc/VRTSvcs/conf/config


dump the exsisting config details in the memory to the disk.


(Solaris1:/root)# cd /etc/VRTSvcs/conf/config
(Solaris1:/etc/VRTSvcs/conf/config)# haconf -dump
(Solaris1:/etc/VRTSvcs/conf/config)#


now create the command file from the main.cf file using the hacf command

(Solaris1:/etc/VRTSvcs/conf/config)# hacf
VCS INFO V-16-1-10601 Usage:
         hacf -cftocmd   [-dest ] [-display]
         hacf -cmdtocf   [-dest ]
         hacf -verify     [-display]
         hacf -help

(Solaris1:/etc/VRTSvcs/conf/config)#


This command will create the command file whioch can be used to recreate the whole cluster.

    hacf -cftocmd   [-dest ] [-display]


(Solaris1:/etc/VRTSvcs/conf/config)# hacf -cftocmd . -dest .


(Solaris1:/etc/VRTSvcs/conf/config)# ls -ltr | tail
-rw-------   1 root     root       29075 Nov 13 14:31 main.cf.13Nov2012.14.31.22
-rw-------   1 root     root       29362 Nov 26 12:32 main.cf.26Nov2012.12.32.09
-rw-------   1 root     root       30286 Nov 27 11:35 main.cf.27Nov2012.11.35.18
-rw-------   1 root     root       30649 Dec 24 10:27 main.cf.24Dec2012.10.27.42
-rw-------   1 root     root       30722 Dec 24 10:29 main.cf.24Dec2012.10.29.13
-rw-------   2 root     root       33096 Feb 25 14:34 main.cf.previous
-rw-------   2 root     root       33096 Feb 25 14:34 main.cf.25Feb2013.14.34.29
-rw-------   2 root     root       33099 Mar 26 11:22 main.cf.26Mar2013.11.22.57
-rw-------   2 root     root       33099 Mar 26 11:22 main.cf
-rw-------   1 root     root      192090 Mar 26 11:23 main.cmd
(Solaris1:/etc/VRTSvcs/conf/config)# 


(Solaris1:/etc/VRTSvcs/conf/config)# more main.cmd
haclus -modify  PrintMsg 0
haclus -modify  UserNames  alex bIGmJLhRJfIReHFgFI bibon ejjEjpInjEgkFg sarco JlmKliLilFljKslNltKglFmh
KulS indigo dKLjKFjQKiKSjHKmKF naglel aHHoHCgIHjHQhEEbED idone cpoLpyOxpJpvLkmKmmKn z_zone1
_Solaris1 chhPilGqhOedDdeAed
haclus -modify  ClusterAddress "10.20.02.180"
haclus -modify  Administrators  alex bibon sarco indigo naglel idone
haclus -modify  SourceFile "./main.cf"
haclus -modify  ClusterName vcs-cluster1
hatype -add ASG
hatype -modify ASMDG SourceFile "./OracleMTypes.cf"
hatype -modify ASMDG AgentDirectory "/opt/VRTSagents/ha/bin/ASG"
hatype -modify ASMDG ArgList Sid Owner Home DBAUser DBAPword DiskGroups EnvFile Encoding
hatype -modify ASMDG ContainerOpts RunInContainer 1 PassCInfo 0
haattr -add ASG Home -string
haattr -add ASG Owner -string

Thursday, September 13, 2012

recoveryoption parameter setting in VXVM


Changing VxVM parameter in order to improve the reaction of the Solaris x86 servers in case of SAN instabilities.  

Veritas level configuration changes to be done on the hosts Solaris1 and  Solaris1. 

There is no impact as it is an online activity 


(Solaris1:/root)#
(Solaris1:/root)# vxdmpadm getattr enclosure hp_p95000
ENCLR_NAME      ATTR_NAME                     DEFAULT        CURRENT
============================================================================
hp_p95000      iopolicy                      MinimumQ       MinimumQ
hp_p95000      partitionsize                 512            512
hp_p95000      use_all_paths                 -              -
hp_p95000      failover_policy               Global         Global
hp_p95000      recoveryoption[throttle]      Nothrottle[0]  Nothrottle[0]
hp_p95000      recoveryoption[errorretry]    Timebound[300] Timebound[300]
hp_p95000      redundancy                    0              0
hp_p95000      dmp_lun_retry_timeout         0              0
hp_p95000      failovermode                  -              -


(Solaris1:/root)# vxdmpadm listenclosure all
ENCLR_NAME        ENCLR_TYPE     ENCLR_SNO      STATUS       ARRAY_TYPE     LUN_COUNT
=======================================================================================
disk              Disk           DISKS                CONNECTED    Disk        4
hp_xp24k5         HP_XP24K       28494                CONNECTED    A/A        199
hp_p95000         HP_P9500       85753                CONNECTED    A/A        127
(Solaris1:/root)#


(Solaris1:/root)# vxdmpadm getattr enclosure hp_p95000
ENCLR_NAME      ATTR_NAME                     DEFAULT        CURRENT
============================================================================
hp_p95000      iopolicy                      MinimumQ       MinimumQ
hp_p95000      partitionsize                 512            512
hp_p95000      use_all_paths                 -              -
hp_p95000      failover_policy               Global         Global
hp_p95000      recoveryoption[throttle]      Nothrottle[0]  Nothrottle[0]
hp_p95000      recoveryoption[errorretry]    Timebound[300] Timebound[300]
hp_p95000      redundancy                    0              0
hp_p95000      dmp_lun_retry_timeout         0              0
hp_p95000      failovermode                  -              -
(Solaris1:/root)#


(Solaris1:/root)#
(Solaris1:/root)# vxdmpadm getattr enclosure hp_xp24k50
ENCLR_NAME      ATTR_NAME                     DEFAULT        CURRENT
============================================================================
hp_xp24k50      iopolicy                      MinimumQ       MinimumQ
hp_xp24k50      partitionsize                 512            512
hp_xp24k50      use_all_paths                 -              -
hp_xp24k50      failover_policy               Global         Global
hp_xp24k50      recoveryoption[throttle]      Nothrottle[0]  Nothrottle[0]
hp_xp24k50      recoveryoption[errorretry]    Timebound[300] Timebound[300]
hp_xp24k50      redundancy                    0              0
hp_xp24k50      dmp_lun_retry_timeout         0              0
hp_xp24k50      failovermode                  -              -
(Solaris1:/root)#


(Solaris1:/root)#
(Solaris1:/root)# vxdmpadm setattr enclosure hp_p95000 recoveryoption=timebound iotimeout=320
(Solaris1:/root)#


(Solaris1:/root)# vxdmpadm getattr enclosure hp_xp24k50
ENCLR_NAME      ATTR_NAME                     DEFAULT        CURRENT
============================================================================
hp_xp24k50      iopolicy                      MinimumQ       MinimumQ
hp_xp24k50      partitionsize                 512            512
hp_xp24k50      use_all_paths                 -              -
hp_xp24k50      failover_policy               Global         Global
hp_xp24k50      recoveryoption[throttle]      Nothrottle[0]  Nothrottle[0]
hp_xp24k50      recoveryoption[errorretry]    Timebound[300] Timebound[300]
hp_xp24k50      redundancy                    0              0
hp_xp24k50      dmp_lun_retry_timeout         0              0
hp_xp24k50      failovermode                  -              -
(Solaris1:/root)# vxdmpadm getattr enclosure hp_p95000
ENCLR_NAME      ATTR_NAME                     DEFAULT        CURRENT
============================================================================
hp_p95000      iopolicy                      MinimumQ       MinimumQ
hp_p95000      partitionsize                 512            512
hp_p95000      use_all_paths                 -              -
hp_p95000      failover_policy               Global         Global
hp_p95000      recoveryoption[throttle]      Nothrottle[0]  Nothrottle[0]
hp_p95000      recoveryoption[errorretry]    Timebound[300] Timebound[320]
hp_p95000      redundancy                    0              0
hp_p95000      dmp_lun_retry_timeout         0              0
hp_p95000      failovermode                  -              -
(Solaris1:/root)#


(Solaris1:/root)#
(Solaris1:/root)# vxdmpadm setattr enclosure hp_xp24k50 recoveryoption=timebound iotimeout=320
(Solaris1:/root)# recoveryoption=timebound iotimeout=320
(Solaris1:/root)#

(Solaris1:/root)#
(Solaris1:/root)# vxdmpadm getattr enclosure hp_xp24k50
ENCLR_NAME      ATTR_NAME                     DEFAULT        CURRENT
============================================================================
hp_xp24k50      iopolicy                      MinimumQ       MinimumQ
hp_xp24k50      partitionsize                 512            512
hp_xp24k50      use_all_paths                 -              -
hp_xp24k50      failover_policy               Global         Global
hp_xp24k50      recoveryoption[throttle]      Nothrottle[0]  Nothrottle[0]
hp_xp24k50      recoveryoption[errorretry]    Timebound[300] Timebound[320]
hp_xp24k50      redundancy                    0              0
hp_xp24k50      dmp_lun_retry_timeout         0              0
hp_xp24k50      failovermode                  -              -
(Solaris1:/root)# vxdmpadm getattr enclosure hp_p95000
ENCLR_NAME      ATTR_NAME                     DEFAULT        CURRENT
============================================================================
hp_p95000      iopolicy                      MinimumQ       MinimumQ
hp_p95000      partitionsize                 512            512
hp_p95000      use_all_paths                 -              -
hp_p95000      failover_policy               Global         Global
hp_p95000      recoveryoption[throttle]      Nothrottle[0]  Nothrottle[0]
hp_p95000      recoveryoption[errorretry]    Timebound[300] Timebound[320]
hp_p95000      redundancy                    0              0
hp_p95000      dmp_lun_retry_timeout         0              0
hp_p95000      failovermode                  -              -
(Solaris1:/root)#


(Solaris1:/root)# vxdmpadm getattr enclosure hp_xp24k50 | grep Timebound
hp_xp24k50      recoveryoption[errorretry]    Timebound[300] Timebound[320]

(Solaris1:/root)# vxdmpadm getattr enclosure hp_p95000 | grep Timebound
hp_p95000      recoveryoption[errorretry]    Timebound[300] Timebound[320]
(Solaris1:/root)#

(Solaris1:/root)# vxdmpadm getattr enclosure hp_xp24k50 | grep Timebound; vxdmpadm getattr enclosure hp_p95000 | grep Timebound
hp_xp24k50      recoveryoption[errorretry]    Timebound[300] Timebound[320]
hp_p95000      recoveryoption[errorretry]    Timebound[300] Timebound[320]
(Solaris1:/root)#

Monday, September 10, 2012

Resize a FS with space from a specific disk


In a disk group, where some volumes are stripped and some are concat, while adding disk space to a concat volume, care needs to be taken.


For STRIPE volumes, when adding a new disk, the whole stripe has to be relayedout.

Solaris1:/root# vxprint -htg DG-DBDG | grep -i STRIP
pl VOL-DBDG-ARCH01-02 VOL-DBDG-ARCH01 ENABLED ACTIVE 1822425088 STRIPE 11/8192 RW
pl VOL-DBDG-DTA01-02 VOL-DBDG-DTA01 ENABLED ACTIVE 678232064 STRIPE 8/8192 RW
pl VOL-DBDG-DTA04-02 VOL-DBDG-DTA04 ENABLED ACTIVE 3091324928 STRIPE 19/8192 RW
pl VOL-DBDG-DTA05-02 VOL-DBDG-DTA05 ENABLED ACTIVE 2910044160 STRIPE 18/8192 RW
pl VOL-DBDG-DTA06-02 VOL-DBDG-DTA06 ENABLED ACTIVE 2910044160 STRIPE 18/8192 RW
pl VOL-DBDG-DTA07-02 VOL-DBDG-DTA07 ENABLED ACTIVE 2910044160 STRIPE 18/8192 RW
pl VOL-DBDG-ID01-02 VOL-DBDG-ID01 ENABLED ACTIVE 671121408 STRIPE 12/8192 RW
pl VOL-DBDG-ID04-02 VOL-DBDG-ID04 ENABLED ACTIVE 1343619072 STRIPE 16/8192 RW
pl VOL-DBDG-ID05-02 VOL-DBDG-ID05 ENABLED ACTIVE 1121992704 STRIPE 14/8192 RW
pl VOL-DBDG-ID06-02 VOL-DBDG-ID06 ENABLED ACTIVE 1121992704 STRIPE 14/8192 RW
pl VOL-DBDG-ID07-02 VOL-DBDG-ID07 ENABLED ACTIVE 1121992704 STRIPE 14/8192 RW
pl VOL-DBDG-MIDO01-02 VOL-DBDG-MIDO01 ENABLED ACTIVE 62914560 STRIPE 8/8192 RW
pl VOL-DBDG-RBS01-02 VOL-DBDG-RBS01 ENABLED ACTIVE 381681664 STRIPE 13/8192 RW
pl VOL-DBDG-RDO01-02 VOL-DBDG-RDO01 ENABLED ACTIVE 62914560 STRIPE 8/8192 RW
pl VOL-DBDG-TMP01-02 VOL-DBDG-TMP01 ENABLED ACTIVE 1006632960 STRIPE 12/8192 RW


For CONCAT volumes, it is straight extension but the disk that was added needs to be used only for this volume and shouldn't mix with the STRIPE volumes.

Solaris1:/root# vxprint -htg DG-DBDG | grep -i CONCAT
pl VOL-DBDG-bkp01-02 VOL-DBDG-bkp01 ENABLED ACTIVE 63955712 CONCAT - RW
pl VOL-DBDG-ptr-02 VOL-DBDG-ptr ENABLED ACTIVE 4194304 CONCAT - RW
pl VOL-DBDG-mon-02 VOL-DBDG-mon ENABLED ACTIVE 2097152 CONCAT -    RW
Solaris1:/root#


Solaris1:/root# df -h /DG-DBDG/bkp01
Filesystem             size   used  avail capacity  Mounted on
/dev/vx/dsk/DG-DBDG/VOL-DBDG-bkp01
                        13G   4.6G   8.3G    36%    /DG-DBDG/bkp01
Solaris1:/root#
Solaris1:/root#


Solaris1:/root# vxprint -htg DG-DBDG -v VOL-DBDG-bkp01
V  NAME         RVG/VSET/CO  KSTATE   STATE    LENGTH   READPOL   PREFPLEX UTYPE
PL NAME         VOLUME       KSTATE   STATE    LENGTH   LAYOUT    NCOL/WID MODE
SD NAME         PLEX         DISK     DISKOFFS LENGTH   [COL/]OFF DEVICE   MODE
SV NAME         PLEX         VOLNAME  NVOLLAYR LENGTH   [COL/]OFF AM/NM    MODE
SC NAME         PLEX         CACHE    DISKOFFS LENGTH   [COL/]OFF DEVICE   MODE
DC NAME         PARENTVOL    LOGVOL
SP NAME         SNAPVOL      DCO
EX NAME         ASSOC        VC                       PERMS    MODE     STATE

v  VOL-DBDG-bkp01 -    ENABLED  ACTIVE   28304128 SELECT    -        fsgen
pl VOL-DBDG-bkp01-02 VOL-DBDG-bkp01 ENABLED ACTIVE 28304128 CONCAT - RW
sd DSK-DBDG-2B04-01 VOL-DBDG-bkp01-02 DSK-DBDG-2B4 0 28304128 0 hp_p95000_2b4 ENA
Solaris1:/root#
Solaris1:/root#


Solaris1:/root# /etc/vx/bin/vxdisksetup -i hp_p95000_2f9

Solaris1:/root# vxdisk -o alldgs -e list | grep -i 2f9
hp_p95000_2f9 auto:cdsdisk   -            -           online thinrclm      c2t50060E80164EF966d80s2 tprclm fc tc-pvol pvol
Solaris1:/root#

Solaris1:/root#
Solaris1:/root# vxdg -g DG-DBDG adddisk DSK-DBDG-2F9=hp_p95000_2f9
Solaris1:/root#


Solaris1:/root# /etc/vx/bin/vxresize -g DG-DBDG VOL-DBDG-bkp01 +17g DSK-DBDG-2F9
Solaris1:/root#


Solaris1:/root# df -h /DG-DBDG/bkp01
Filesystem             size   used  avail capacity  Mounted on
/dev/vx/dsk/DG-DBDG/VOL-DBDG-bkp01
                        30G   4.6G    24G    17%    /DG-DBDG/bkp01
Solaris1:/root#



Solaris1:/root# vxprint -htg DG-DBDG -v VOL-DBDG-bkp01
V  NAME         RVG/VSET/CO  KSTATE   STATE    LENGTH   READPOL   PREFPLEX UTYPE
PL NAME         VOLUME       KSTATE   STATE    LENGTH   LAYOUT    NCOL/WID MODE
SD NAME         PLEX         DISK     DISKOFFS LENGTH   [COL/]OFF DEVICE   MODE
SV NAME         PLEX         VOLNAME  NVOLLAYR LENGTH   [COL/]OFF AM/NM    MODE
SC NAME         PLEX         CACHE    DISKOFFS LENGTH   [COL/]OFF DEVICE   MODE
DC NAME         PARENTVOL    LOGVOL
SP NAME         SNAPVOL      DCO
EX NAME         ASSOC        VC                       PERMS    MODE     STATE

v  VOL-DBDG-bkp01 -    ENABLED  ACTIVE   63955712 SELECT    -        fsgen
pl VOL-DBDG-bkp01-02 VOL-DBDG-bkp01 ENABLED ACTIVE 63955712 CONCAT - RW
sd DSK-DBDG-2B04-01 VOL-DBDG-bkp01-02 DSK-DBDG-2B4 0 28304128 0 hp_p95000_2b4 ENA
sd DSK-DBDG-2F98-01 VOL-DBDG-bkp01-02 DSK-DBDG-2F9 0 35651584 28304128 hp_p95000_2f9 ENA
Solaris1:/root#

Friday, August 31, 2012

Failsafe Mode in zone


Failsafe Mode in a Zone

This is used to recover a damaged zone when the normal login doesn't work

(SolarisPhy:/)# zlogin SolZone
[Connected to zone 'SolZone' pts/11]
Login incorrect

[Connection to zone 'SolZone' pts/11 closed]
(SolarisPhy:/)#

Use -s to login as failsafe mode

(SolarisPhy:/)# zlogin -S SolZone
[Connected to zone 'SolZone' pts/11]
# hostname
SolZone