To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????}B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111110101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f7d42
SJIS-WIN 也ゅ???ぐ娃??也ゅ???ぐ娃??}B 10010110111001111000001011100011001111110011111100111111100000101010111010001000101000010011111100111111100101101110011110000010111000110011111100111111001111111000001010101110100010001010000100111111001111110111110101000010 96e782e33f3f3f82ae88a13f3f96e782e33f3f3f82ae88a13f3f7d42
EUC-JP 也ゅ???ぐ娃??也ゅ???ぐ娃??}B 11001100111010011010010011100101001111110011111100111111101001001011000010110000101000110011111100111111110011001110100110100100111001010011111100111111001111111010010010110000101100001010001100111111001111110111110101000010 cce9a4e53f3f3fa4b0b0a33f3fcce9a4e53f3f3fa4b0b0a33f3f7d42
UTF-8 也ゅ뜵呂묋ぐ娃쒎콪也ゅ뜵呂묋ぐ娃쒏릍}B 1110010010111001100111111110001110000010100001011110101110011100101101011110111110100110100000001110101110101100100010111110001110000001100100001110010110101000100000111110110010010010100011101110110010111101101010101110010010111001100111111110001110000010100001011110101110011100101101011110111110100110100000001110101110101100100010111110001110000001100100001110010110101000100000111110110010010010100011111110101110100110100011010111110101000010 e4b99fe38285eb9cb5efa680ebac8be38190e5a883ec928eecbdaae4b99fe38285eb9cb5efa680ebac8be38190e5a883ec928feba68d7d42
UHC 也ゅ뜵呂묋ぐ娃쒎콪也ゅ뜵呂묋ぐ娃쒏릍}B 1110010110100101101010101110010110001101101100111110010111111011100100011110100010101010101100001110100011011111100111001110010110110001100111101110010110100101101010101110010110001101101100111110010111111011100100011110100010101010101100001110100011011111100111001110011010111000101011000111110101000010 e5a5aae58db3e5fb91e8aab0e8df9ce5b19ee5a5aae58db3e5fb91e8aab0e8df9ce6b8ac7d42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)