To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???肛??辱????肛??辱?B 001111110011111100111111111000111110100000111111001111111001000001001010001111110011111100111111001111111110001111101000001111110011111110010000010010100011111101000010 3f3f3fe3e83f3f904a3f3f3f3fe3e83f3f904a3f42
EUC-JP ???肛??辱????肛??辱?B 001111110011111100111111111001101110101000111111001111111011111110101011001111110011111100111111001111111110011011101010001111110011111110111111101010110011111101000010 3f3f3fe6ea3f3fbfab3f3f3f3fe6ea3f3fbfab3f42
UTF-8 降싷슭肛됵슬辱쳚降싷슭肛됵슬辱쳚B 11101111101010001000100111101100100010111011011111101100100010101010110111101000100000101001101111101011100100001011010111101100100010101010110011101000101111101011000111101100101100111001101011101111101010001000100111101100100010111011011111101100100010101010110111101000100000101001101111101011100100001011010111101100100010101010110011101000101111101011000111101100101100111001101001000010 efa889ec8bb7ec8aade8829beb90b5ec8aace8beb1ecb39aefa889ec8bb7ec8aade8829beb90b5ec8aace8beb1ecb39a42
UHC 降싷슭肛됵슬辱쳚降싷슭肛됵슬辱쳚B 111110101010001010011010111011111011110110111110111110011111110110001001111011111011110110111101111010011011010010101011011110101111101010100010100110101110111110111101101111101111100111111101100010011110111110111101101111011110100110110100101010110111101001000010 faa29aefbdbef9fd89efbdbde9b4ab7afaa29aefbdbef9fd89efbdbde9b4ab7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)