To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????M 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101001101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4d
SJIS-WIN 形??衡①?艶g?形??衡①?艶{》M 100011000110000000111111001111111000110101110100100001110100000000111111100010011001000010000010100001110011111110001100011000000011111100111111100011010111010010000111010000000011111110001001100100001000000101101111100000010111010001001101 8c603f3f8d7487403f899082873f8c603f3f8d7487403f8990816f81744d
EUC-JP 形??衡??艶g?形??衡??艶{》M 10110111110000010011111100111111101110011101010100111111001111111011000111110000101000111110011100111111101101111100000100111111001111111011100111010101001111110011111110110001111100001010000111010000101000011101010101001101 b7c13f3fb9d53f3fb1f0a3e73fb7c13f3fb9d53f3fb1f0a1d0a1d54d
UTF-8 形쀮맖衡①컮艶g왃形쀮맖衡①컮艶{》M 11100101101111011010001011101100100000001010111011101011101001111001011011101000101000011010000111100010100100011010000011101100101110111010111011101000100010011011011011101111101111011000011111101100100110011000001111100101101111011010001011101100100000001010111011101011101001111001011011101000101000011010000111100010100100011010000011101100101110111010111011101000100010011011011011101111101111011001101111100011100000001000101101001101 e5bda2ec80aeeba796e8a1a1e291a0ecbbaee889b6efbd87ec9983e5bda2ec80aeeba796e8a1a1e291a0ecbbaee889b6efbd9be3808b4d
UHC 形쀮맖衡①컮艶g왃形쀮맖衡①컮艶{》M 11111011101000011001011111101110100100001010100011111011101011001010100011100111101100001001010011100110111111011010001111100111100111101011011011111011101000011001011111101110100100001010100011111011101011001010100011100111101100001001010011100110111111011010001111111011101000011011011101001101 fba197ee90a8fbaca8e7b094e6fda3e79eb6fba197ee90a8fbaca8e7b094e6fda3fba1b74d

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)