To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?ル???????濡?????悟?????^ 00111111100000111000101100111111001111110011111100111111001111110011111100111111100101000100011100111111001111110011111100111111001111111000110011100101001111110011111100111111001111110011111101011110 3f838b3f3f3f3f3f3f3f94473f3f3f3f3f8ce53f3f3f3f3f5e
EUC-JP 邕ル?饔?????濡?????悟?????^ 1000111111100001111011011010010111101011001111111000111111101000111011110011111100111111001111110011111100111111110001111010100000111111001111110011111100111111001111111011100011100111001111110011111100111111001111110011111101011110 8fe1eda5eb3f8fe8ef3f3f3f3f3fc7a83f3f3f3f3fb8e73f3f3f3f3f5e
UTF-8 邕ル졏饔낁삏溜삳졁濡싪뺀溜쇠퐛悟뽯졎溜뀀줁^ 11101001100000101001010111100011100000111010101111101100101000011000111111101001101001011001010011101011100000101000000111101100100000101000111111101111101001111000101111101100100000101011001111101100101000011000000111100110101111111010000111101100100010111010101011101011101110101000000011101111101001111000101111101100100001111010000011101101100100001001101111100110100000101001111111101011101111011010111111101100101000011000111011101111101001111000101111101011100000001000000011101100101001001000000101011110 e98295e383abeca18fe9a594eb8281ec828fefa78bec82b3eca181e6bfa1ec8baaebba80efa78bec87a0ed909be6829febbdafeca18eefa78beb8080eca4815e
UHC 邕ル졏饔낁삏溜삳졁濡싪뺀溜쇠퐛悟뽯졎溜뀀줁^ 11101000101110111010101111101011101000001011110011101000101111011000010111101000100110001001011011101010111111101011101111101011101000001011001011101011101000011001101011101000101110111010101111101010111111101011110011101000101111011000010111100111111101101001011011101011101000001011101111101010111111101011001011101011101000011001100001011110 e8bbabeba0bce8bd85e89896eafebbeba0b2eba19ae8bbabeafebce8bd85e7f696eba0bbeafeb2eba1985e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)