To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???意??循?????徇??貫踰??宥?? 00111111001111110011111110001000110100110011111100111111100011110111101000111111001111110011111100111111001111111001110001101101001111110011111110001010110100011110011011111010001111110011111110010111010001110011111100111111 3f3f3f88d33f3f8f7a3f3f3f3f3f9c6d3f3f8ad1e6fa3f3f97473f3f
EUC-JP ???意??循????Ŧ徇??貫踰??宥?? 001111110011111100111111101100001101010100111111001111111011110111011011001111110011111100111111001111111000111110101001101011111101011111001110001111110011111110110100110100111110110011111100001111110011111111001101101010000011111100111111 3f3f3fb0d53f3fbddb3f3f3f3f8fa9afd7ce3f3fb4d3ecfc3f3fcda83f3f
UTF-8 嶺뚮슢意덂츦循뉖쇀若뗫Ŧ徇띸춯貫踰ㅸ굢宥몃왂 1110111110100110101010111110101110011010101011101110110010001010101000101110011010000100100011111110101110001101100000101110110010111000101001101110010110111110101010101110101110001001100101101110110010000111100000001110111110100101101101001110101110010111101010111100010110100110111001011011111010000111111010111001110110111000111011001011011010101111111010001011001010101011111010001011100010110000111000111000010110111000111010101011010110100010111001011010111010100101111010111010101010000011111011001001100110000010 efa6abeb9aaeec8aa2e6848feb8d82ecb8a6e5beaaeb8996ec8780efa5b4eb97abc5a6e5be87eb9db8ecb6afe8b2abe8b8b0e385b8eab5a2e5aea5ebaa83ec9982
UHC 嶺뚮슢意덂츦循뉖쇀若뗫Ŧ徇띸춯貫踰ㅸ굢宥몃왂 1110011110101101100011001110101110011010101011101110101111110010100010001110010110101110100111001110001011100000100001111110101110011001101101001110010110101110100010111110101110101000101011101110001011011111100011011110011110101101100011001100111010111011111010111011001010100100111010001000001010001001111010101110100110111000111010111001111010110101 e7ad8ceb9aaeebf288e5ae9ce2e087eb99b4e5ae8beba8aee2df8de7ad8ccebbebb2a4e88289eae9b8eb9eb5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)