To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????}B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111110101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f7d42
SJIS-WIN 沃??墺?ぐ娃??沃??秧?ぐ娃??}B 10010111100000000011111100111111100110101101001000111111100000101010111010001000101000010011111100111111100101111000000000111111001111111110001001011110001111111000001010101110100010001010000100111111001111110111110101000010 97803f3f9ad23f82ae88a13f3f97803f3fe25e3f82ae88a13f3f7d42
EUC-JP 沃??墺?ぐ娃??沃??秧?ぐ娃??}B 11001101111000000011111100111111110101001101010000111111101001001011000010110000101000110011111100111111110011011110000000111111001111111110001110111111001111111010010010110000101100001010001100111111001111110111110101000010 cde03f3fd4d43fa4b0b0a33f3fcde03f3fe3bf3fa4b0b0a33f3f7d42
UTF-8 沃곈걶墺듣ぐ娃쒍퓱沃곈걶秧녘ぐ娃쒑큹}B 1110011010110010100000111110101010110011100010001110101010110001101101101110010110100010101110101110101110010011101000111110001110000001100100001110010110101000100000111110110010010010100011011110110110010011101100011110011010110010100000111110101010110011100010001110101010110001101101101110011110100111101001111110101110000101100110001110001110000001100100001110010110101000100000111110110010010010100100011110110110000001101110010111110101000010 e6b283eab388eab1b6e5a2baeb93a3e38190e5a883ec928ded93b1e6b283eab388eab1b6e7a7a7eb8598e38190e5a883ec9291ed81b97d42
UHC 沃곈걶墺듣ぐ娃쒍퓱沃곈걶秧녘ぐ娃쒑큹}B 1110100010101010101100001110100110000001100111001110011111110010101101011110100010101010101100001110100011011111100111001110010010111111100101111110100010101010101100001110100110000001100111001110010011101011101100111110100010101010101100001110100011011111100111001110100010110100100010000111110101000010 e8aab0e9819ce7f2b5e8aab0e8df9ce4bf97e8aab0e9819ce4ebb3e8aab0e8df9ce8b4887d42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)