To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????F}v???????F}vB 001111110011111100111111001111110011111100111111001111110100011001111101011101100011111100111111001111110011111100111111001111110011111101000110011111010111011001000010 3f3f3f3f3f3f3f467d763f3f3f3f3f3f3f467d7642
SJIS-WIN 嶸疾弴フ骼ゥ自F}v嶸疾弴フ骼ゥ自F}vB 11111010101101001000111010111110111110101011100011001100111010011000111010101001100011101010100101000110011111010111011011111010101101001000111010111110111110101011100011001100111010011000111010101001100011101010100101000110011111010111011001000010 fab48ebefab8cce98ea98ea9467d76fab48ebefab8cce98ea98ea9467d7642
EUC-JP 嶸疾弴フ骼ゥ自F}v嶸疾弴フ骼ゥ自F}vB 100011111011101111110100101111001100000010001111101111001110110110001110110011001111000111101110100011101010100110111100101010110100011001111101011101101000111110111011111101001011110011000000100011111011110011101101100011101100110011110001111011101000111010101001101111001010101101000110011111010111011001000010 8fbbf4bcc08fbced8eccf1ee8ea9bcab467d768fbbf4bcc08fbced8eccf1ee8ea9bcab467d7642
UTF-8 嶸疾弴フ骼ゥ自F}v嶸疾弴フ骼ゥ自F}vB 11100101101101101011100011100111100101101011111011100101101111001011010011101111101111101000110011101001101010101011110011101111101111011010100111101000100001111010101001000110011111010111011011100101101101101011100011100111100101101011111011100101101111001011010011101111101111101000110011101001101010101011110011101111101111011010100111101000100001111010101001000110011111010111011001000010 e5b6b8e796bee5bcb4efbe8ce9aabcefbda9e887aa467d76e5b6b8e796bee5bcb4efbe8ce9aabcefbda9e887aa467d7642
UHC 嶸疾????自F}v嶸疾????自F}vB 111001111010111011110010111100000011111100111111001111110011111111101101101110110100011001111101011101101110011110101110111100101111000000111111001111110011111100111111111011011011101101000110011111010111011001000010 e7aef2f03f3f3f3fedbb467d76e7aef2f03f3f3f3fedbb467d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)