To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????U}?????????U{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101010101111101001111110011111100111111001111110011111100111111001111110011111100111111010101010111101101011110 3f3f3f3f3f3f3f3f3f557d3f3f3f3f3f3f3f3f3f557b5e
SJIS-WIN 臟捧??紗可???U}臟捧??紗可???U{^ 11100100011001101001010111111001001111110011111110001110110100011000100111000010001111110011111100111111010101010111110111100100011001101001010111111001001111110011111110001110110100011000100111000010001111110011111100111111010101010111101101011110 e46695f93f3f8ed189c23f3f3f557de46695f93f3f8ed189c23f3f3f557b5e
EUC-JP 臟捧??紗可獐??U}臟捧??紗可獐??U{^ 1110011111000111110010101111101100111111001111111011110011010011101100101100010010001111110010111011101000111111001111110101010101111101111001111100011111001010111110110011111100111111101111001101001110110010110001001000111111001011101110100011111100111111010101010111101101011110 e7c7cafb3f3fbcd3b2c48fcbba3f3f557de7c7cafb3f3fbcd3b2c48fcbba3f3f557b5e
UTF-8 臟捧뱄綎紗可獐쇤맛U}臟捧뱄綎紗可獐쇤맛U{^ 1110100010000111100111111110011010001101101001111110101110110001100001001110011110110110100011101110011110110100100101111110010110001111101011111110011110001101100100001110110010000111101001001110101110100111100110110101010101111101111010001000011110011111111001101000110110100111111010111011000110000100111001111011011010001110111001111011010010010111111001011000111110101111111001111000110110010000111011001000011110100100111010111010011110011011010101010111101101011110 e8879fe68da7ebb184e7b68ee7b497e58fafe78d90ec87a4eba79b557de8879fe68da7ebb184e7b68ee7b497e58fafe78d90ec87a4eba79b557b5e
UHC 臟捧뱄綎紗可獐쇤맛U}臟捧뱄綎紗可獐쇤맛U{^ 1110110111110100110111001110100110111001111011111110111111110010110111101110100111001010101001101110110111101111101111001110100110111000110000000101010101111101111011011111010011011100111010011011100111101111111011111111001011011110111010011100101010100110111011011110111110111100111010011011100011000000010101010111101101011110 edf4dce9b9efeff2dee9caa6edefbce9b8c0557dedf4dce9b9efeff2dee9caa6edefbce9b8c0557b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)