To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 沃????ぐ娃??沃????ぐ娃??沃? 100101111000000000111111001111110011111100111111100000101010111010001000101000010011111100111111100101111000000000111111001111110011111100111111100000101010111010001000101000010011111100111111100101111000000000111111 97803f3f3f3f82ae88a13f3f97803f3f3f3f82ae88a13f3f97803f
EUC-JP 沃????ぐ娃??沃????ぐ娃??沃? 110011011110000000111111001111110011111100111111101001001011000010110000101000110011111100111111110011011110000000111111001111110011111100111111101001001011000010110000101000110011111100111111110011011110000000111111 cde03f3f3f3fa4b0b0a33f3fcde03f3f3f3fa4b0b0a33f3fcde03f
UTF-8 沃곈걶呂묋ぐ娃쒍퍟沃곈걶呂묋ぐ娃쒒걶沃곈 111001101011001010000011111010101011001110001000111010101011000110110110111011111010011010000000111010111010110010001011111000111000000110010000111001011010100010000011111011001001001010001101111011011000110110011111111001101011001010000011111010101011001110001000111010101011000110110110111011111010011010000000111010111010110010001011111000111000000110010000111001011010100010000011111011001001001010010010111010101011000110110110111001101011001010000011111010101011001110001000 e6b283eab388eab1b6efa680ebac8be38190e5a883ec928ded8d9fe6b283eab388eab1b6efa680ebac8be38190e5a883ec9292eab1b6e6b283eab388
UHC 沃곈걶呂묋ぐ娃쒍퍟沃곈걶呂묋ぐ娃쒒걶沃곈 11101000101010101011000011101001100000011001110011100101111110111001000111101000101010101011000011101000110111111001110011100100101110111001011011101000101010101011000011101001100000011001110011100101111110111001000111101000101010101011000011101000110111111001110011101001100000011001110011101000101010101011000011101001 e8aab0e9819ce5fb91e8aab0e8df9ce4bb96e8aab0e9819ce5fb91e8aab0e8df9ce9819ce8aab0e9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)