To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????z??????zB 001111110011111100111111001111110011111100111111011110100011111100111111001111110011111100111111001111110111101001000010 3f3f3f3f3f3f7a3f3f3f3f3f3f7a42
SJIS-WIN ?冀あ?訥ぢz?冀あ?訥ぢzB 0011111110011001011000101000001010100000001111111110011001100011100000101100000001111010001111111001100101100010100000101010000000111111111001100110001110000010110000000111101001000010 3f996282a03fe66382c07a3f996282a03fe66382c07a42
EUC-JP ?冀あ?訥ぢz?冀あ?訥ぢzB 0011111111010001110000111010010010100010001111111110101111000100101001001100001001111010001111111101000111000011101001001010001000111111111010111100010010100100110000100111101001000010 3fd1c3a4a23febc4a4c27a3fd1c3a4a23febc4a4c27a42
UTF-8 룵冀あ룶訥ぢz룵冀あ룶訥ぢzB 111010111010001110110101111001011000011010000000111000111000000110000010111010111010001110110110111010001010100010100101111000111000000110100010011110101110101110100011101101011110010110000110100000001110001110000001100000101110101110100011101101101110100010101000101001011110001110000001101000100111101001000010 eba3b5e58680e38182eba3b6e8a8a5e381a27aeba3b5e58680e38182eba3b6e8a8a5e381a27a42
UHC 룵冀あ룶訥ぢz룵冀あ룶訥ぢzB 100011111010101011010000111011011010101010100010100011111010101111010010111011011010101011000010011110101000111110101010110100001110110110101010101000101000111110101011110100101110110110101010110000100111101001000010 8faad0edaaa28fabd2edaac27a8faad0edaaa28fabd2edaac27a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)