To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????D????D^ 0011111100111111001111110011111101000100001111110011111100111111001111110100010001011110 3f3f3f3f443f3f3f3f445e
SJIS-WIN 陜呻スェD陜呻スェD^ 111010001001110110011001111011111011110110101010010001001110100010011101100110011110111110111101101010100100010001011110 e89d99efbdaa44e89d99efbdaa445e
EUC-JP 陜呻スェD陜呻スェD^ 11101111111111011101001011110001100011101011110110001110101010100100010011101111111111011101001011110001100011101011110110001110101010100100010001011110 effdd2f18ebd8eaa44effdd2f18ebd8eaa445e
UTF-8 陜呻スェD陜呻スェD^ 111010011001100110011100111001011001000110111011111011111011110110111101111011111011110110101010010001001110100110011001100111001110010110010001101110111110111110111101101111011110111110111101101010100100010001011110 e9999ce591bbefbdbdefbdaa44e9999ce591bbefbdbdefbdaa445e
UHC 陜呻??D陜呻??D^ 111110011111000011100011111000100011111100111111010001001111100111110000111000111110001000111111001111110100010001011110 f9f0e3e23f3f44f9f0e3e23f3f445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)