To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???猿??筍リ?B 00111111001111110011111110001001100011100011111100111111111000101010000110000011100010100011111101000010 3f3f3f898e3f3fe2a1838a3f42
EUC-JP ???猿??筍リ?B 00111111001111110011111110110001111011100011111100111111111001001010001110100101111010100011111101000010 3f3f3fb1ee3f3fe4a3a5ea3f42
UTF-8 蓮곥렖猿섊뇖筍リ퍡B 11101111101001101001100111101010101100111010010111101011101000001001011011100111100011001011111111101100100001001000101011101011100001111001011011100111101011011000110111100011100000111010101011101101100011011010000101000010 efa699eab3a5eba096e78cbfec848aeb8796e7ad8de383aaed8da142
UHC 蓮곥렖猿섊뇖筍リ퍡B 11100110111001011000000111100011100011101010101111101010101110111001100011100111100001111000000111100010111011001010101111101010101110111001100001000010 e6e581e38eabeabb98e78781e2ecabeabb9842

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)