To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????}v????????}vB 001111110011111100111111001111110011111100111111001111110011111101111101011101100011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f7d7642
SJIS-WIN 蜿ゥ蜿ゥ謗「陲芳}v蜿ゥ蜿ゥ謗「陲芳}vB 11100101100011111010100111100101100011111010100111100110100011101010001011101000101000101001011001000110011111010111011011100101100011111010100111100101100011111010100111100110100011101010001011101000101000101001011001000110011111010111011001000010 e58fa9e58fa9e68ea2e8a296467d76e58fa9e58fa9e68ea2e8a296467d7642
EUC-JP 蜿ゥ蜿ゥ謗「陲芳}v蜿ゥ蜿ゥ謗「陲芳}vB 11101001111011111000111010101001111010011110111110001110101010011110101111101110100011101010001011110000101001001100101110100111011111010111011011101001111011111000111010101001111010011110111110001110101010011110101111101110100011101010001011110000101001001100101110100111011111010111011001000010 e9ef8ea9e9ef8ea9ebee8ea2f0a4cba77d76e9ef8ea9e9ef8ea9ebee8ea2f0a4cba77d7642
UTF-8 蜿ゥ蜿ゥ謗「陲芳}v蜿ゥ蜿ゥ謗「陲芳}vB 1110100010011100101111111110111110111101101010011110100010011100101111111110111110111101101010011110100010101100100101111110111110111101101000101110100110011001101100101110100010001010101100110111110101110110111010001001110010111111111011111011110110101001111010001001110010111111111011111011110110101001111010001010110010010111111011111011110110100010111010011001100110110010111010001000101010110011011111010111011001000010 e89cbfefbda9e89cbfefbda9e8ac97efbda2e999b2e88ab37d76e89cbfefbda9e89cbfefbda9e8ac97efbda2e999b2e88ab37d7642
UHC ????謗??芳}v????謗??芳}vB 00111111001111110011111100111111110110111011111100111111001111111101101110111011011111010111011000111111001111110011111100111111110110111011111100111111001111111101101110111011011111010111011001000010 3f3f3f3fdbbf3f3fdbbb7d763f3f3f3fdbbf3f3fdbbb7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)