To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????[??????????[^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011011001111110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 臟捧???臧????[臟捧???臧????[^ 1110010001100110100101011111100100111111001111110011111111100100011010000011111100111111001111110011111101011011111001000110011010010101111110010011111100111111001111111110010001101000001111110011111100111111001111110101101101011110 e46695f93f3f3fe4683f3f3f3f5be46695f93f3f3fe4683f3f3f3f5b5e
EUC-JP 臟捧?玎?臧?獐??[臟捧?玎?臧?獐??[^ 11100111110001111100101011111011001111111000111111001011110100100011111111100111110010010011111110001111110010111011101000111111001111110101101111100111110001111100101011111011001111111000111111001011110100100011111111100111110010010011111110001111110010111011101000111111001111110101101101011110 e7c7cafb3f8fcbd23fe7c93f8fcbba3f3f5be7c7cafb3f8fcbd23fe7c93f8fcbba3f3f5b5e
UTF-8 臟捧뱄玎렕臧렎獐쇨톼[臟捧뱄玎렕臧렎獐쇨톼[^ 111010001000011110011111111001101000110110100111111010111011000110000100111001111000111010001110111010111010000010010101111010001000011110100111111010111010000010001110111001111000110110010000111011001000011110101000111011011000011010111100010110111110100010000111100111111110011010001101101001111110101110110001100001001110011110001110100011101110101110100000100101011110100010000111101001111110101110100000100011101110011110001101100100001110110010000111101010001110110110000110101111000101101101011110 e8879fe68da7ebb184e78e8eeba095e887a7eba08ee78d90ec87a8ed86bc5be8879fe68da7ebb184e78e8eeba095e887a7eba08ee78d90ec87a8ed86bc5b5e
UHC 臟捧뱄玎렕臧렎獐쇨톼[臟捧뱄玎렕臧렎獐쇨톼[^ 11101101111101001101110011101001101110011110111111101111111010011000111010101010111011011111010110001110101001001110110111101111101111001110101011000101111011010101101111101101111101001101110011101001101110011110111111101111111010011000111010101010111011011111010110001110101001001110110111101111101111001110101011000101111011010101101101011110 edf4dce9b9efefe98eaaedf58ea4edefbceac5ed5bedf4dce9b9efefe98eaaedf58ea4edefbceac5ed5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)