To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN セュ鴆蒔社セャ蒔セォ鴫セュ鴆蒔社セャ蒔セャ鄙 1011111010101101111010011110111110001110101010101000111011010000101111101010110010001110101010101011111010101011100011101011000010111110101011011110100111101111100011101010101010001110110100001011111010101100100011101010101010111110101011001110011110111111 beade9ef8eaa8ed0beac8eaabeab8eb0beade9ef8eaa8ed0beac8eaabeace7bf
EUC-JP セュ鴆蒔社セャ蒔セォ鴫セュ鴆蒔社セャ蒔セャ鄙 1000111010111110100011101010110111110010111100011011110010101100101111001101001010001110101111101000111010101100101111001010110010001110101111101000111010101011101111001011001010001110101111101000111010101101111100101111000110111100101011001011110011010010100011101011111010001110101011001011110010101100100011101011111010001110101011001110111011000001 8ebe8eadf2f1bcacbcd28ebe8eacbcac8ebe8eabbcb28ebe8eadf2f1bcacbcd28ebe8eacbcac8ebe8eaceec1
UTF-8 セュ鴆蒔社セャ蒔セォ鴫セュ鴆蒔社セャ蒔セャ鄙 111011111011110110111110111011111011110110101101111010011011010010000110111010001001001010010100111001111010010010111110111011111011110110111110111011111011110110101100111010001001001010010100111011111011110110111110111011111011110110101011111010011011010010101011111011111011110110111110111011111011110110101101111010011011010010000110111010001001001010010100111001111010010010111110111011111011110110111110111011111011110110101100111010001001001010010100111011111011110110111110111011111011110110101100111010011000010010011001 efbdbeefbdade9b486e89294e7a4beefbdbeefbdace89294efbdbeefbdabe9b4abefbdbeefbdade9b486e89294e7a4beefbdbeefbdace89294efbdbeefbdace98499
UHC ???蒔社??蒔??????蒔社??蒔??鄙 0011111100111111001111111110001111001000110111101110010000111111001111111110001111001000001111110011111100111111001111110011111100111111111000111100100011011110111001000011111100111111111000111100100000111111001111111101111010101001 3f3f3fe3c8dee43f3fe3c83f3f3f3f3f3fe3c8dee43f3fe3c83f3fdea9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)