To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????}v????????}vB 001111110011111100111111001111110011111100111111001111110011111101111101011101100011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f7d7642
SJIS-WIN 荳ケ隱ー襠醍カ哥}v荳ケ隱ー襠醍カ哥}vB 11100100101110001011100111101000101010101011000011100101111110111001000111100111101101101001101001000110011111010111011011100100101110001011100111101000101010101011000011100101111110111001000111100111101101101001101001000110011111010111011001000010 e4b8b9e8aab0e5fb91e7b69a467d76e4b8b9e8aab0e5fb91e7b69a467d7642
EUC-JP 荳ケ隱ー襠醍カ哥}v荳ケ隱ー襠醍カ哥}vB 11101000101110101000111010111001111100001010110010001110101100001110101011111101110000101110100110001110101101101101001110100111011111010111011011101000101110101000111010111001111100001010110010001110101100001110101011111101110000101110100110001110101101101101001110100111011111010111011001000010 e8ba8eb9f0ac8eb0eafdc2e98eb6d3a77d76e8ba8eb9f0ac8eb0eafdc2e98eb6d3a77d7642
UTF-8 荳ケ隱ー襠醍カ哥}v荳ケ隱ー襠醍カ哥}vB 1110100010001101101100111110111110111101101110011110100110011010101100011110111110111101101100001110100010100101101000001110100110000110100011011110111110111101101101101110010110010011101001010111110101110110111010001000110110110011111011111011110110111001111010011001101010110001111011111011110110110000111010001010010110100000111010011000011010001101111011111011110110110110111001011001001110100101011111010111011001000010 e88db3efbdb9e99ab1efbdb0e8a5a0e9868defbdb6e593a57d76e88db3efbdb9e99ab1efbdb0e8a5a0e9868defbdb6e593a57d7642
UHC 荳?隱??醍?哥}v荳?隱??醍?哥}vB 1101010011100101001111111110101111011111001111110011111111110000101101010011111111001010101010000111110101110110110101001110010100111111111010111101111100111111001111111111000010110101001111111100101010101000011111010111011001000010 d4e53febdf3f3ff0b53fcaa87d76d4e53febdf3f3ff0b53fcaa87d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)