To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????^?????}v?????^?????}vB 001111110011111100111111001111110011111101011110001111110011111100111111001111110011111101111101011101100011111100111111001111110011111100111111010111100011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f5e3f3f3f3f3f7d763f3f3f3f3f5e3f3f3f3f3f7d7642
SJIS-WIN 蝴???賻^蝴???阜}v蝴???賻^蝴???阜}vB 1110010110011010001111110011111100111111111001101101000001011110111001011001101000111111001111110011111110010101100011000111110101110110111001011001101000111111001111110011111111100110110100000101111011100101100110100011111100111111001111111001010110001100011111010111011001000010 e59a3f3f3fe6d05ee59a3f3f3f958c7d76e59a3f3f3fe6d05ee59a3f3f3f958c7d7642
EUC-JP 蝴???賻^蝴???阜}v蝴???賻^蝴???阜}vB 1110100111111010001111110011111100111111111011001101001001011110111010011111101000111111001111110011111111001001111011000111110101110110111010011111101000111111001111110011111111101100110100100101111011101001111110100011111100111111001111111100100111101100011111010111011001000010 e9fa3f3f3fecd25ee9fa3f3f3fc9ec7d76e9fa3f3f3fecd25ee9fa3f3f3fc9ec7d7642
UTF-8 蝴렫롋꾀賻^蝴렫롋꾀阜}v蝴렫롋꾀賻^蝴렫롋꾀阜}vB 11101000100111011011010011101011101000001010101111101011101000011000101111101010101111101000000011101000101100111011101101011110111010001001110110110100111010111010000010101011111010111010000110001011111010101011111010000000111010011001100010011100011111010111011011101000100111011011010011101011101000001010101111101011101000011000101111101010101111101000000011101000101100111011101101011110111010001001110110110100111010111010000010101011111010111010000110001011111010101011111010000000111010011001100010011100011111010111011001000010 e89db4eba0abeba18beabe80e8b3bb5ee89db4eba0abeba18beabe80e9989c7d76e89db4eba0abeba18beabe80e8b3bb5ee89db4eba0abeba18beabe80e9989c7d7642
UHC 蝴렫롋꾀賻^蝴렫롋꾀阜}v蝴렫롋꾀賻^蝴렫롋꾀阜}vB 1111101111011101100011101011100110001110110100011011001011010010110111011011100001011110111110111101110110001110101110011000111011010001101100101101001011011101101111010111110101110110111110111101110110001110101110011000111011010001101100101101001011011101101110000101111011111011110111011000111010111001100011101101000110110010110100101101110110111101011111010111011001000010 fbdd8eb98ed1b2d2ddb85efbdd8eb98ed1b2d2ddbd7d76fbdd8eb98ed1b2d2ddb85efbdd8eb98ed1b2d2ddbd7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)