To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 癲??依??淫??n}癲??依??淫??n{^ 1110000110011111001111110011111110001000110010110011111100111111100010001111101000111111001111110110111001111101111000011001111100111111001111111000100011001011001111110011111110001000111110100011111100111111011011100111101101011110 e19f3f3f88cb3f3f88fa3f3f6e7de19f3f3f88cb3f3f88fa3f3f6e7b5e
EUC-JP 癲??依??淫??n}癲??依??淫??n{^ 1110001010100001001111110011111110110000110011010011111100111111101100001111110000111111001111110110111001111101111000101010000100111111001111111011000011001101001111110011111110110000111111000011111100111111011011100111101101011110 e2a13f3fb0cd3f3fb0fc3f3f6e7de2a13f3fb0cd3f3fb0fc3f3f6e7b5e
UTF-8 癲븐룇依루랜淫륁삂n}癲븐룇依루랜淫륁삂n{^ 1110011110011001101100101110101110111000100100001110101110100011100001111110010010111110100111011110101110100011101010001110101110011110100111001110011010110111101010111110101110100101100000011110110010000010100000100110111001111101111001111001100110110010111010111011100010010000111010111010001110000111111001001011111010011101111010111010001110101000111010111001111010011100111001101011011110101011111010111010010110000001111011001000001010000010011011100111101101011110 e799b2ebb890eba387e4be9deba3a8eb9e9ce6b7abeba581ec82826e7de799b2ebb890eba387e4be9deba3a8eb9e9ce6b7abeba581ec82826e7b5e
UHC 癲븐룇依루랜淫륁삂n}癲븐룇依루랜淫륁삂n{^ 1110111110100110101110101110110010001111100001101110101111101110101101111110011110110111101000111110101111100010100011111110110010011000100010010110111001111101111011111010011010111010111011001000111110000110111010111110111010110111111001111011011110100011111010111110001010001111111011001001100010001001011011100111101101011110 efa6baec8f86ebeeb7e7b7a3ebe28fec98896e7defa6baec8f86ebeeb7e7b7a3ebe28fec98896e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)