To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 厭??蘊ゆ?猥??熱??辱??疫??疫??B 100010010111110100111111001111111110010101011101100000101110010000111111111000001100111000111111001111111001010001001101001111110011111110010000010010100011111100111111100010010111010100111111001111111000100101110101001111110011111101000010 897d3f3fe55d82e43fe0ce3f3f944d3f3f904a3f3f89753f3f89753f3f42
EUC-JP 厭??蘊ゆ?猥??熱??辱??疫??疫??B 101100011101111000111111001111111110100110111110101001001110011000111111111000001101000000111111001111111100011110101110001111110011111110111111101010110011111100111111101100011101011000111111001111111011000111010110001111110011111101000010 b1de3f3fe9bea4e63fe0d03f3fc7ae3f3fbfab3f3fb1d63f3fb1d63f3f42
UTF-8 厭얗큳蘊ゆ뮈猥덃뮈熱썹뼻辱껇나疫욥뇠疫욜뮵B 11100101100011101010110111101100100101101001011111101101100000011011001111101000100110001000101011100011100000101000011011101011101011101000100011100111100011001010010111101011100011011000001111101011101011101000100011100111100001101011000111101100100011011011100111101011101111001011101111101000101111101011000111101010101110111000011111101011100000101001100011100111100101101010101111101100100110101010010111101011100001111010000011100111100101101010101111101100100110101001110011101011101011101011010101000010 e58eadec9697ed81b3e8988ae38286ebae88e78ca5eb8d83ebae88e786b1ec8db9ebbcbbe8beb1eabb87eb8298e796abec9aa5eb87a0e796abec9a9cebaeb542
UHC 厭얗큳蘊ゆ뮈猥덃뮈熱썹뼻辱껇나疫욥뇠疫욜뮵B 11100110111101001011111011101001101101001000001111101000101100111010101011100110101110011011111111101000111001011000100011100110101110011011111111100110111100001011110111100111100101101011111011101001101101001000001111101000101100111010101011100110101110011011111111101001100001111000100011100110101110011011111111100111100100101011110101000010 e6f4bee9b483e8b3aae6b9bfe8e588e6b9bfe6f0bde796bee9b483e8b3aae6b9bfe98788e6b9bfe792bd42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)