To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鸚??意??醫?????悠よい怨??鵝 111010100101111100111111001111111000100011010011001111110011111111100111110011100011111100111111001111110011111100111111100101110100100110000010111001101000001010100010100010011000010100111111001111111110101001000000 ea5f3f3f88d33f3fe7ce3f3f3f3f3f974982e682a289853f3fea40
EUC-JP 鸚??意??醫?????悠よい怨??鵝 111100111100000000111111001111111011000011010101001111110011111111101110110100000011111100111111001111110011111100111111110011011010101010100100111010001010010010100100101100011110010100111111001111111111001110100001 f3c03f3fb0d53f3feed03f3f3f3f3fcdaaa4e8a4a4b1e53f3ff3a1
UTF-8 鸚쒓퍓意쎿룚醫덉첎廬믪슦悠よい怨룻떢鵝 111010011011100010011010111011001001001010010011111011011000110110010011111001101000010010001111111011001000111010111111111010111010001110011010111010011000011010101011111010111000110110001001111011001011001010001110111011111010011010000010111010111010111110101010111011001000101010100110111001101000001010100000111000111000001010001000111000111000000110000100111001101000000010101000111010111010001110111011111010111001011010100010111010011011010110011101 e9b89aec9293ed8d93e6848fec8ebfeba39ae986abeb8d89ecb28eefa682ebafaaec8aa6e682a0e38288e38184e680a8eba3bbeb96a2e9b59d
UHC 鸚쒓퍓意쎿룚醫덉첎廬믪슦悠よい怨룻떢鵝 1110010110100100100111001110101010111011100010101110101111110010100110111110011010001111100101101110110010100010100010001110110010101010100110111110010111111110100100101110110010011010101100001110101011101101101010101110100010101010101001001110101010110011101101111110110110001011101101101110010010111101 e5a49ceabb8aebf29be68f96eca288ecaa9be5fe92ec9ab0eaedaae8aaa4eab3b7ed8bb6e4bd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)