To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????|????|[????|????|[^ 0011111100111111001111110011111101111100001111110011111100111111001111110111110001011011001111110011111100111111001111110111110000111111001111110011111100111111011111000101101101011110 3f3f3f3f7c3f3f3f3f7c5b3f3f3f3f7c3f3f3f3f7c5b5e
SJIS-WIN 鱆軸炅漆|鱆軸炅漆|[鱆軸炅漆|鱆軸炅漆|[^ 111010011110000110001110101100101111101101010001100011101011110101111100111010011110000110001110101100101111101101010001100011101011110101111100010110111110100111100001100011101011001011111011010100011000111010111101011111001110100111100001100011101011001011111011010100011000111010111101011111000101101101011110 e9e18eb2fb518ebd7ce9e18eb2fb518ebd7c5be9e18eb2fb518ebd7ce9e18eb2fb518ebd7c5b5e
EUC-JP 鱆軸炅漆|鱆軸炅漆|[鱆軸炅漆|鱆軸炅漆|[^ 11110010111000111011110010110100100011111100100111001010101111001011111101111100111100101110001110111100101101001000111111001001110010101011110010111111011111000101101111110010111000111011110010110100100011111100100111001010101111001011111101111100111100101110001110111100101101001000111111001001110010101011110010111111011111000101101101011110 f2e3bcb48fc9cabcbf7cf2e3bcb48fc9cabcbf7c5bf2e3bcb48fc9cabcbf7cf2e3bcb48fc9cabcbf7c5b5e
UTF-8 鱆軸炅漆|鱆軸炅漆|[鱆軸炅漆|鱆軸炅漆|[^ 11101001101100011000011011101000101110111011100011100111100000101000010111100110101111001000011001111100111010011011000110000110111010001011101110111000111001111000001010000101111001101011110010000110011111000101101111101001101100011000011011101000101110111011100011100111100000101000010111100110101111001000011001111100111010011011000110000110111010001011101110111000111001111000001010000101111001101011110010000110011111000101101101011110 e9b186e8bbb8e78285e6bc867ce9b186e8bbb8e78285e6bc867c5be9b186e8bbb8e78285e6bc867ce9b186e8bbb8e78285e6bc867c5b5e
UHC ?軸炅漆|?軸炅漆|[?軸炅漆|?軸炅漆|[^ 0011111111110101111011101100110011011101111101101101010001111100001111111111010111101110110011001101110111110110110101000111110001011011001111111111010111101110110011001101110111110110110101000111110000111111111101011110111011001100110111011111011011010100011111000101101101011110 3ff5eeccddf6d47c3ff5eeccddf6d47c5b3ff5eeccddf6d47c3ff5eeccddf6d47c5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)