To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????i?????????iB 001111110011111100111111001111110011111100111111001111110011111100111111011010010011111100111111001111110011111100111111001111110011111100111111001111110110100101000010 3f3f3f3f3f3f3f3f3f693f3f3f3f3f3f3f3f3f6942
SJIS-WIN 帳??橈?????i帳??橈?????iB 10010010101000000011111100111111100111101111010000111111001111110011111100111111001111110110100110010010101000000011111100111111100111101111010000111111001111110011111100111111001111110110100101000010 92a03f3f9ef43f3f3f3f3f6992a03f3f9ef43f3f3f3f3f6942
EUC-JP 帳??橈?????i帳??橈?????iB 11000100101000100011111100111111110111001111011000111111001111110011111100111111001111110110100111000100101000100011111100111111110111001111011000111111001111110011111100111111001111110110100101000010 c4a23f3fdcf63f3f3f3f3f69c4a23f3fdcf63f3f3f3f3f6942
UTF-8 帳쏉푴橈곁죪樂됵풛i帳쏉푴橈곁죪樂됵풛iB 111001011011100010110011111011001000111110001001111011011001000110110100111001101010100110001000111010101011001110000001111011001010001110101010111011111010011010111111111010111001000010110101111011011001001010011011011010011110010110111000101100111110110010001111100010011110110110010001101101001110011010101001100010001110101010110011100000011110110010100011101010101110111110100110101111111110101110010000101101011110110110010010100110110110100101000010 e5b8b3ec8f89ed91b4e6a988eab381eca3aaefa6bfeb90b5ed929b69e5b8b3ec8f89ed91b4e6a988eab381eca3aaefa6bfeb90b5ed929b6942
UHC 帳쏉푴橈곁죪樂됵풛i帳쏉푴橈곁죪樂됵풛iB 111011011110001110011011111011111011111010000010111010001111101010110000111001111010000110000101111010001111100110001001111011111011111010011110011010011110110111100011100110111110111110111110100000101110100011111010101100001110011110100001100001011110100011111001100010011110111110111110100111100110100101000010 ede39befbe82e8fab0e7a185e8f989efbe9e69ede39befbe82e8fab0e7a185e8f989efbe9e6942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)