To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 俉??節??晤??渦??穩??節??晤??鈺 111110100110000100111111001111111001000011011111001111110011111110011101111010110011111100111111100010010101000100111111001111111110001001110010001111110011111110010000110111110011111100111111100111011110101100111111001111111111101111000100 fa613f3f90df3f3f9deb3f3f89513f3fe2723f3f90df3f3f9deb3f3ffbc4
EUC-JP 俉??節??晤??渦??穩??節??晤??鈺 1000111110110001101110110011111100111111110000001110000100111111001111111101101011101101001111110011111110110001101100100011111100111111111000111101001100111111001111111100000011100001001111110011111111011010111011010011111100111111100011111110001111010101 8fb1bb3f3fc0e13f3fdaed3f3fb1b23f3fe3d33f3fc0e13f3fdaed3f3f8fe3d5
UTF-8 俉녑쪍節쏙쉼晤볩슘渦뤄슉穩뷸뎸節곤슴晤볩슘鈺 111001001011111110001001111010111000010110010001111011001010101010001101111001111010111110000000111011001000111110011001111011001000100110111100111001101001100110100100111010111011001110101001111011001000101010011000111001101011100010100110111010111010010010000100111011001000101010001001111001111010100110101001111010111011011110111000111010111000111010111000111001111010111110000000111010101011001110100100111011001000101010110100111001101001100110100100111010111011001110101001111011001000101010011000111010011000100010111010 e4bf89eb8591ecaa8de7af80ec8f99ec89bce699a4ebb3a9ec8a98e6b8a6eba484ec8a89e7a9a9ebb7b8eb8eb8e7af80eab3a4ec8ab4e699a4ebb3a9ec8a98e988ba
UHC 俉녑쪍節쏙쉼晤볩슘渦뤄슉穩뷸뎸節곤슴晤볩슘鈺 1110011111101011101100111110010110100101100001111110111110111101101111011110111110111101101100001110011111111011100100111110111110111101101101111110100010111110101101111110111110111101101101011110100010110001101110101110011010001001100010111110111110111101101100001110111110111101101111111110011111111011100100111110111110111101101101111110100010101101 e7ebb3e5a587efbdbdefbdb0e7fb93efbdb7e8beb7efbdb5e8b1bae6898befbdb0efbdbfe7fb93efbdb7e8ad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)