To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN 辱⑥?膺??懿??B 1001000001001010100001110100010100111111111001000101111000111111001111111001110011110010001111110011111101000010 904a87453fe45e3f3f9cf23f3f42
EUC-JP 辱??膺??懿??B 10111111101010110011111100111111111001111011111100111111001111111101100011110100001111110011111101000010 bfab3f3fe7bf3f3fd8f43f3f42
UTF-8 辱⑥빖膺볣뇽懿몄럳B 11101000101111101011000111100010100100011010010111101011101110011001011011101000100001101011101011101011101100111010001111101011100001111011110111100110100001111011111111101011101010101000010011101011100111111011001101000010 e8beb1e291a5ebb996e886baebb3a3eb87bde687bfebaa84eb9fb342
UHC 辱⑥빖膺볣뇽懿몄럳B 11101001101101001010100011101100100101011011100011101011111011001001001111101001101101001010100011101011111100111011100011101100100011101001001101000010 e9b4a8ec95b8ebec93e9b4a8ebf3b8ec8e9342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)