To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?8???音??壤??癲?8???音??壤 111000011001111100111111100000100101011100111111001111110011111110001001101110010011111100111111100110101101111100111111001111111110000110011111001111111000001001010111001111110011111100111111100010011011100100111111001111111001101011011111 e19f3f82573f3f3f89b93f3f9adf3f3fe19f3f82573f3f3f89b93f3f9adf
EUC-JP 癲?8???音??壤??癲?8???音??壤 111000101010000100111111101000111011100000111111001111110011111110110010101110110011111100111111110101001110000100111111001111111110001010100001001111111010001110111000001111110011111100111111101100101011101100111111001111111101010011100001 e2a13fa3b83f3f3fb2bb3f3fd4e13f3fe2a13fa3b83f3f3fb2bb3f3fd4e1
UTF-8 癲쒕8杻잏뵳音뚰맩壤쏇듁癲쒕8杻잏뵳音뚰맩壤 111001111001100110110010111011001001001010010101111011111011110010011000111011111010011110001000111011001001111010001111111010111011010110110011111010011001111110110011111010111001101010110000111010111010011110101001111001011010001110100100111011001000111110000111111010111001001110000001111001111001100110110010111011001001001010010101111011111011110010011000111011111010011110001000111011001001111010001111111010111011010110110011111010011001111110110011111010111001101010110000111010111010011110101001111001011010001110100100 e799b2ec9295efbc98efa788ec9e8febb5b3e99fb3eb9ab0eba7a9e5a3a4ec8f87eb9381e799b2ec9295efbc98efa788ec9e8febb5b3e99fb3eb9ab0eba7a9e5a3a4
UHC 癲쒕8杻잏뵳音뚰맩壤쏇듁癲쒕8杻잏뵳音뚰맩壤 1110111110100110100111001110101110100011101110001110101011110100100111111110011110010100101100011110101111100101100011001110110110010000101100011110010110111101100110111110110110001010101101101110111110100110100111001110101110100011101110001110101011110100100111111110011110010100101100011110101111100101100011001110110110010000101100011110010110111101 efa69ceba3b8eaf49fe794b1ebe58ced90b1e5bd9bed8ab6efa69ceba3b8eaf49fe794b1ebe58ced90b1e5bd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)