To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 也や??цぐ娃??渦????ぐ娃??沃?? 100101101110011110000010111000100011111100111111100001001000100010000010101011101000100010100001001111110011111110001001010100010011111100111111001111110011111110000010101011101000100010100001001111110011111110010111100000000011111100111111 96e782e23f3f848882ae88a13f3f89513f3f3f3f82ae88a13f3f97803f3f
EUC-JP 也や??цぐ娃??渦????ぐ娃??沃?? 110011001110100110100100111001000011111100111111101001111110100010100100101100001011000010100011001111110011111110110001101100100011111100111111001111110011111110100100101100001011000010100011001111110011111111001101111000000011111100111111 cce9a4e43f3fa7e8a4b0b0a33f3fb1b23f3f3f3fa4b0b0a33f3fcde03f3f
UTF-8 也や퓱歷цぐ娃쒍퍟渦욘뜆呂묋ぐ娃쒍퓱沃겻퍟 1110010010111001100111111110001110000010100001001110110110010011101100011110111110100110100011001101000110000110111000111000000110010000111001011010100010000011111011001001001010001101111011011000110110011111111001101011100010100110111011001001101010011000111010111001110010000110111011111010011010000000111010111010110010001011111000111000000110010000111001011010100010000011111011001001001010001101111011011001001110110001111001101011001010000011111010101011001010111011111011011000110110011111 e4b99fe38284ed93b1efa68cd186e38190e5a883ec928ded8d9fe6b8a6ec9a98eb9c86efa680ebac8be38190e5a883ec928ded93b1e6b283eab2bbed8d9f
UHC 也や퓱歷цぐ娃쒍퍟渦욘뜆呂묋ぐ娃쒍퓱沃겻퍟 111001011010010110101010111001001011111110010111111001101011100010101100111010001010101010110000111010001101111110011100111001001011101110010110111010001011111010111111111001101000110110001001111001011111101110010001111010001010101010110000111010001101111110011100111001001011111110010111111010001010101010110000111001001011101110010110 e5a5aae4bf97e6b8ace8aab0e8df9ce4bb96e8bebfe68d89e5fb91e8aab0e8df9ce4bf97e8aab0e4bb96

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)