To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鸚??揖у?兪???????Ⅴ揄?????飮 1110101001011111001111110011111110010111010010111000010010000101001111111001100101100000001111110011111100111111001111110011111100111111001111111000011101011000100111011000100100111111001111110011111100111111001111111001111101011010 ea5f3f3f974b84853f99603f3f3f3f3f3f3f87589d893f3f3f3f3f9f5a
EUC-JP 鸚??揖у?兪????????揄?????飮 11110011110000000011111100111111110011011010110010100111111001010011111111010001110000010011111100111111001111110011111100111111001111110011111100111111110110011110100100111111001111110011111100111111001111111101110110111011 f3c03f3fcdaca7e53fd1c13f3f3f3f3f3f3f3fd9e93f3f3f3f3fddbb
UTF-8 鸚쒖눦揖у선兪낆댉凉깅끆利몌Ⅴ揄몄뒾凉깅냵飮 1110100110111000100110101110110010010010100101101110101110001000101001101110011010001111100101101101000110000011111011001000010010100000111001011000010110101010111010111000001010000110111010111000110010001001111011111010010110111001111010101011100110000101111010111000000110000110111011111010011110011101111010111010101010001100111000101000010110100100111001101000111110000100111010111010101010000100111010111001001010111110111011111010010110111001111010101011100110000101111010111000001110110101111010011010001110101110 e9b89aec9296eb88a6e68f96d183ec84a0e585aaeb8286eb8c89efa5b9eab985eb8186efa79debaa8ce285a4e68f84ebaa84eb92beefa5b9eab985eb83b5e9a3ae
UHC 鸚쒖눦揖у선兪낆댉凉깅끆利몌Ⅴ揄몄뒾凉깅냵飮 1110010110100100100111001110110010000111101111011110101111100111101011001110010110111100101100011110101011100100100001011110110010001000101100101110010110111100101100011110101110000101101110101110110010100110101110001110111110100101101101001110101011110001101110001110110010001010101101001110010110111100101100011110101110000110100001011110101111100110 e5a49cec87bdebe7ace5bcb1eae485ec88b2e5bcb1eb85baeca6b8efa5b4eaf1b8ec8ab4e5bcb1eb8685ebe6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)