To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 豎梦ッ」妺シ諱ゎ椽豎梦ッ」妺シ諱ゎ椽^ 11100110101100011001101011101011101011111010001111111010101001011011110011100110100000011000001011101100100111101011101111100110101100011001101011101011101011111010001111111010101001011011110011100110100000011000001011101100100111101011101101011110 e6b19aebafa3faa5bce68182ec9ebbe6b19aebafa3faa5bce68182ec9ebb5e
EUC-JP 豎梦ッ」妺シ諱ゎ椽豎梦ッ」妺シ諱ゎ椽^ 111011001011001111010100111011011000111010101111100011101010001110001111101110011011011110001110101111001110101111100001101001001110111011011100101111011110110010110011110101001110110110001110101011111000111010100011100011111011100110110111100011101011110011101011111000011010010011101110110111001011110101011110 ecb3d4ed8eaf8ea38fb9b78ebcebe1a4eedcbdecb3d4ed8eaf8ea38fb9b78ebcebe1a4eedcbd5e
UTF-8 豎梦ッ」妺シ諱ゎ椽豎梦ッ」妺シ諱ゎ椽^ 11101000101100011000111011100110101000101010011011101111101111011010111111101111101111011010001111100101101001101011101011101111101111011011110011101000101010111011000111100011100000101000111011100110101001001011110111101000101100011000111011100110101000101010011011101111101111011010111111101111101111011010001111100101101001101011101011101111101111011011110011101000101010111011000111100011100000101000111011100110101001001011110101011110 e8b18ee6a2a6efbdafefbda3e5a6baefbdbce8abb1e3828ee6a4bde8b18ee6a2a6efbdafefbda3e5a6baefbdbce8abb1e3828ee6a4bd5e
UHC ??????諱ゎ椽??????諱ゎ椽^ 00111111001111110011111100111111001111110011111111111101110010011010101011101110111001101100101100111111001111110011111100111111001111110011111111111101110010011010101011101110111001101100101101011110 3f3f3f3f3f3ffdc9aaeee6cb3f3f3f3f3f3ffdc9aaeee6cb5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)