To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????N}????????N{^ 001111110011111100111111001111110011111100111111001111110011111101001110011111010011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 辱??蚣辱???N}辱??蚣辱???N{^ 100100000100101000111111001111111110010101101110100100000100101000111111001111110011111101001110011111011001000001001010001111110011111111100101011011101001000001001010001111110011111100111111010011100111101101011110 904a3f3fe56e904a3f3f3f4e7d904a3f3fe56e904a3f3f3f4e7b5e
EUC-JP 辱??蚣辱??珙N}辱??蚣辱??珙N{^ 10111111101010110011111100111111111010011100111110111111101010110011111100111111100011111100101111110001010011100111110110111111101010110011111100111111111010011100111110111111101010110011111100111111100011111100101111110001010011100111101101011110 bfab3f3fe9cfbfab3f3f8fcbf14e7dbfab3f3fe9cfbfab3f3f8fcbf14e7b5e
UTF-8 辱잓죴蚣辱잓죴珙N}辱잓죴蚣辱잓죴珙N{^ 1110100010111110101100011110110010011110100100111110110010100011101101001110100010011010101000111110100010111110101100011110110010011110100100111110110010100011101101001110011110001111100110010100111001111101111010001011111010110001111011001001111010010011111011001010001110110100111010001001101010100011111010001011111010110001111011001001111010010011111011001010001110110100111001111000111110011001010011100111101101011110 e8beb1ec9e93eca3b4e89aa3e8beb1ec9e93eca3b4e78f994e7de8beb1ec9e93eca3b4e89aa3e8beb1ec9e93eca3b4e78f994e7b5e
UHC 辱잓죴蚣辱잓죴珙N}辱잓죴蚣辱잓죴珙N{^ 11101001101101001001111111101001101000011000111111001101111101111110100110110100100111111110100110100001100011111100110111110101010011100111110111101001101101001001111111101001101000011000111111001101111101111110100110110100100111111110100110100001100011111100110111110101010011100111101101011110 e9b49fe9a18fcdf7e9b49fe9a18fcdf54e7de9b49fe9a18fcdf7e9b49fe9a18fcdf54e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)