To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 疫??疫??疫??狎??歪ゆ?弱??癌??軟 10001001011101010011111100111111100010010111010100111111001111111000100101110101001111110011111111100000101111100011111100111111100110000110001110000010111001000011111110001110111000110011111100111111100010101110000000111111001111111001001111101110 89753f3f89753f3f89753f3fe0be3f3f986382e43f8ee33f3f8ae03f3f93ee
EUC-JP 疫??疫??疫??狎??歪ゆ?弱??癌??軟 10110001110101100011111100111111101100011101011000111111001111111011000111010110001111110011111111100000110000000011111100111111110011111100010010100100111001100011111110111100111001010011111100111111101101001110001000111111001111111100011011110000 b1d63f3fb1d63f3fb1d63f3fe0c03f3fcfc4a4e63fbce53f3fb4e23f3fc6f0
UTF-8 疫욥궕疫욕컙疫욜땸狎숁뮈歪ゆ뮈弱꾣뮈癌꿩뮈軟 111001111001011010101011111011001001101010100101111010101011011010010101111001111001011010101011111011001001101010010101111011001011101110011001111001111001011010101011111011001001101010011100111010111001010110111000111001111000101110001110111011001000100010000001111010111010111010001000111001101010110110101010111000111000001010000110111010111010111010001000111001011011110010110001111010101011111010100011111010111010111010001000111001111001100110001100111010101011111110101001111010111010111010001000111010001011101110011111 e796abec9aa5eab695e796abec9a95ecbb99e796abec9a9ceb95b8e78b8eec8881ebae88e6adaae38286ebae88e5bcb1eabea3ebae88e7998ceabfa9ebae88e8bb9f
UHC 疫욥궕疫욕컙疫욜땸狎숁뮈歪ゆ뮈弱꾣뮈癌꿩뮈軟 1110011010111001101111111110100110000010101010101110011010111001101111111110010110110000100001001110011010111001101111111110011110001011100011101110010011100100100110011110011010111001101111111110100011100000101010101110011010111001101111111110010110110000100001001110011010111001101111111110010011011111101100101110011010111001101111111110011011100011 e6b9bfe982aae6b9bfe5b084e6b9bfe78b8ee4e499e6b9bfe8e0aae6b9bfe5b084e6b9bfe4dfb2e6b9bfe6e3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)