To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????i?????????iB 001111110011111100111111001111110011111100111111001111110011111100111111011010010011111100111111001111110011111100111111001111110011111100111111001111110110100101000010 3f3f3f3f3f3f3f3f3f693f3f3f3f3f3f3f3f3f6942
SJIS-WIN 畑?????醫??i畑?????醫??iB 10010100101010000011111100111111001111110011111100111111111001111100111000111111001111110110100110010100101010000011111100111111001111110011111100111111111001111100111000111111001111110110100101000010 94a83f3f3f3f3fe7ce3f3f6994a83f3f3f3f3fe7ce3f3f6942
EUC-JP 畑?????醫??i畑?????醫??iB 11001000101010100011111100111111001111110011111100111111111011101101000000111111001111110110100111001000101010100011111100111111001111110011111100111111111011101101000000111111001111110110100101000010 c8aa3f3f3f3f3feed03f3f69c8aa3f3f3f3f3feed03f3f6942
UTF-8 畑댁옃栒롥쉽醫꾨퍠i畑댁옃栒롥쉽醫꾨퍠iB 111001111001010110010001111010111000110010000001111011001001100010000011111001101010000010010010111010111010000110100101111011001000100110111101111010011000011010101011111010101011111010101000111011011000110110100000011010011110011110010101100100011110101110001100100000011110110010011000100000111110011010100000100100101110101110100001101001011110110010001001101111011110100110000110101010111110101010111110101010001110110110001101101000000110100101000010 e79591eb8c81ec9883e6a092eba1a5ec89bde986abeabea8ed8da069e79591eb8c81ec9883e6a092eba1a5ec89bde986abeabea8ed8da06942
UHC 畑댁옃栒롥쉽醫꾨퍠i畑댁옃栒롥쉽醫꾨퍠iB 111011111010010110110100111011001001111010001111111000101110001110001110111001011011110110110001111011001010001010000100111010111011101110010111011010011110111110100101101101001110110010011110100011111110001011100011100011101110010110111101101100011110110010100010100001001110101110111011100101110110100101000010 efa5b4ec9e8fe2e38ee5bdb1eca284ebbb9769efa5b4ec9e8fe2e38ee5bdb1eca284ebbb976942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)