To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 畑??油??音??汚??游←?議??檍 100101001010100000111111001111111001011011111011001111110011111110001001101110010011111100111111100010011001100000111111001111111001111111100000100000011010100100111111100010110110001100111111001111111001111011111000 94a83f3f96fb3f3f89b93f3f89983f3f9fe081a93f8b633f3f9ef8
EUC-JP 畑??油??音??汚??游←?議??檍 110010001010101000111111001111111100110011111101001111110011111110110010101110110011111100111111101100011111100000111111001111111101111011100010101000101010101100111111101101011100010000111111001111111101110011111010 c8aa3f3fccfd3f3fb2bb3f3fb1f83f3fdee2a2ab3fb5c43f3fdcfa
UTF-8 畑밴퉭油뉐츦音썬룋汚삳슢游←춯議쇰걠檍 111001111001010110010001111010111011000010110100111011011000100110101101111001101011001010111001111010111000100110010000111011001011100010100110111010011001111110110011111011001000110110101100111010111010001110001011111001101011000110011010111011001000001010110011111011001000101010100010111001101011100010111000111000101000011010010000111011001011011010101111111010001010110110110000111011001000011110110000111010101011000110100000111001101010101010001101 e79591ebb0b4ed89ade6b2b9eb8990ecb8a6e99fb3ec8daceba38be6b19aec82b3ec8aa2e6b8b8e28690ecb6afe8adb0ec87b0eab1a0e6aa8d
UHC 畑밴퉭油뉐츦音썬룋汚삳슢游←춯議쇰걠檍 1110111110100101101110011110101010111001100001011110101011111010100001111110010110101110100111001110101111100101101111011110001110001111100010101110011111111101101110111110101110011010101011101110101011111101101000011110011110101101100011001110110010100001101111001110101110000001100010011110010111100101 efa5b9eab985eafa87e5ae9cebe5bde38f8ae7fdbbeb9aaeeafda1e7ad8ceca1bceb8189e5e5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)