To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????m 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f6d
SJIS-WIN ???????????????鶯????????m 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111110100111110010001111110011111100111111001111110011111100111111001111110011111101101101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3fe9f23f3f3f3f3f3f3f3f6d
EUC-JP ???????????????鶯????????m 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111111001011110100001111110011111100111111001111110011111100111111001111110011111101101101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3ff2f43f3f3f3f3f3f3f3f6d
UTF-8 溜삳젘溜븍젘溜븍젎溜삳젔溜븍뿊鶯숇젎溜삳젾溜븍젻m 11101111101001111000101111101100100000101011001111101100101000001001100011101111101001111000101111101011101110001000110111101100101000001001100011101111101001111000101111101011101110001000110111101100101000001000111011101111101001111000101111101100100000101011001111101100101000001001010011101111101001111000101111101011101110001000110111101011101111111000101011101001101101101010111111101100100010001000011111101100101000001000111011101111101001111000101111101100100000101011001111101100101000001011111011101111101001111000101111101011101110001000110111101100101000001011101101101101 efa78bec82b3eca098efa78bebb88deca098efa78bebb88deca08eefa78bec82b3eca094efa78bebb88debbf8ae9b6afec8887eca08eefa78bec82b3eca0beefa78bebb88deca0bb6d
UHC 溜삳젘溜븍젘溜븍젎溜삳젔溜븍뿊鶯숇젎溜삳젾溜븍젻m 11101010111111101011101111101011101000001001010011101010111111101011101011101011101000001001010011101010111111101011101011101011101000001000111111101010111111101011101111101011101000001001001011101010111111101011101011101011100101111001000111100101101000111001100111101011101000001000111111101010111111101011101111101011101000001011000011101010111111101011101011101011101000001010111001101101 eafebbeba094eafebaeba094eafebaeba08feafebbeba092eafebaeb9791e5a399eba08feafebbeba0b0eafebaeba0ae6d

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)