To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????R????^[????R????^[^ 0011111100111111001111110011111101010010001111110011111100111111001111110101111001011011001111110011111100111111001111110101001000111111001111110011111100111111010111100101101101011110 3f3f3f3f523f3f3f3f5e5b3f3f3f3f523f3f3f3f5e5b5e
SJIS-WIN 豼頑ソ穫R豼頑ソ穫^[豼頑ソ穫R豼頑ソ穫^[^ 1110011010111111100010101110011010111111100010100110111001010010111001101011111110001010111001101011111110001010011011100101111001011011111001101011111110001010111001101011111110001010011011100101001011100110101111111000101011100110101111111000101001101110010111100101101101011110 e6bf8ae6bf8a6e52e6bf8ae6bf8a6e5e5be6bf8ae6bf8a6e52e6bf8ae6bf8a6e5e5b5e
EUC-JP 豼頑ソ穫R豼頑ソ穫^[豼頑ソ穫R豼頑ソ穫^[^ 111011001100000110110100111010001000111010111111101100111100111101010010111011001100000110110100111010001000111010111111101100111100111101011110010110111110110011000001101101001110100010001110101111111011001111001111010100101110110011000001101101001110100010001110101111111011001111001111010111100101101101011110 ecc1b4e88ebfb3cf52ecc1b4e88ebfb3cf5e5becc1b4e88ebfb3cf52ecc1b4e88ebfb3cf5e5b5e
UTF-8 豼頑ソ穫R豼頑ソ穫^[豼頑ソ穫R豼頑ソ穫^[^ 11101000101100011011110011101001101000001001000111101111101111011011111111100111101010011010101101010010111010001011000110111100111010011010000010010001111011111011110110111111111001111010100110101011010111100101101111101000101100011011110011101001101000001001000111101111101111011011111111100111101010011010101101010010111010001011000110111100111010011010000010010001111011111011110110111111111001111010100110101011010111100101101101011110 e8b1bce9a091efbdbfe7a9ab52e8b1bce9a091efbdbfe7a9ab5e5be8b1bce9a091efbdbfe7a9ab52e8b1bce9a091efbdbfe7a9ab5e5b5e
UHC ?頑?穫R?頑?穫^[?頑?穫R?頑?穫^[^ 00111111111010001101011100111111111111001010111001010010001111111110100011010111001111111111110010101110010111100101101100111111111010001101011100111111111111001010111001010010001111111110100011010111001111111111110010101110010111100101101101011110 3fe8d73ffcae523fe8d73ffcae5e5b3fe8d73ffcae523fe8d73ffcae5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)