To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 畑????キ???筌??循ュ?循??魚 1001010010101000001111110011111100111111001111111000001101001100001111110011111100111111111000101010001100111111001111111000111101111010100000111000010100111111100011110111101000111111001111111000101110011011 94a83f3f3f3f834c3f3f3fe2a33f3f8f7a83853f8f7a3f3f8b9b
EUC-JP 畑????キ洧??筌??循ュ?循??魚 11001000101010100011111100111111001111110011111110100101101011011000111111000111101101000011111100111111111001001010010100111111001111111011110111011011101001011110010100111111101111011101101100111111001111111011010111111011 c8aa3f3f3f3fa5ad8fc7b43f3fe4a53f3fbddba5e53fbddb3f3fb5fb
UTF-8 畑띕끂留㎬キ洧꾨븶筌먦룂循ュ춢循뗫쐡魚 111001111001010110010001111010111001110110010101111010111000000110000010111011111010011110001101111000111000111010101100111000111000001010101101111001101011010010100111111010101011111010101000111010111011100010110110111001111010110110001100111010111010100010100110111010111010001110000010111001011011111010101010111000111000001110100101111011001011011010100010111001011011111010101010111010111001011110101011111011001001000010100001111010011010110110011010 e79591eb9d95eb8182efa78de38eace382ade6b4a7eabea8ebb8b6e7ad8ceba8a6eba382e5beaae383a5ecb6a2e5beaaeb97abec90a1e9ad9a
UHC 畑띕끂留㎬キ洧꾨븶筌먦룂循ュ춢循뗫쐡魚 1110111110100101101101101110101110000101101110001110101110100111101001111110100010101011101011011110101011111011100001001110101110010101100111111110111110100111100100001110001110001111100000111110001011100000101010111110010110101101100000111110001011100000100010111110101110011100100001111110010111100000 efa5b6eb85b8eba7a7e8abadeafb84eb959fefa790e38f83e2e0abe5ad83e2e08beb9c87e5e0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)