To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 悌???辛皎濾以?悌???辛皎濾淫?^ 1001001011101110001111110011111100111111100100000110100011100001101001111110000001101000100010001100100000111111100100101110111000111111001111110011111110010000011010001110000110100111111000000110100010001000111110100011111101011110 92ee3f3f3f9068e1a7e06888c83f92ee3f3f3f9068e1a7e06888fa3f5e
EUC-JP 悌???辛皎濾以?悌???辛皎濾淫?^ 1100010011110000001111110011111100111111101111111100100111100010101010011101111111001001101100001100101000111111110001001111000000111111001111110011111110111111110010011110001010101001110111111100100110110000111111000011111101011110 c4f03f3f3fbfc9e2a9dfc9b0ca3fc4f03f3f3fbfc9e2a9dfc9b0fc3f5e
UTF-8 悌ㅹ렦렑辛皎濾以렓悌ㅹ렦렑辛皎濾淫렒^ 11100110100000101000110011100011100001011011100111101011101000001010011011101011101000001001000111101000101111101001101111100111100110101000111011100110101111111011111011100100101110111010010111101011101000001001001111100110100000101000110011100011100001011011100111101011101000001010011011101011101000001001000111101000101111101001101111100111100110101000111011100110101111111011111011100110101101111010101111101011101000001001001001011110 e6828ce385b9eba0a6eba091e8be9be79a8ee6bfbee4bba5eba093e6828ce385b9eba0a6eba091e8be9be79a8ee6bfbee6b7abeba0925e
UHC 悌ㅹ렦렑辛皎濾以렓悌ㅹ렦렑辛皎濾淫렒^ 11110000101010101010010011101001100011101011010110001110101001101110001111110100110011101110101111010101111010111110110010100100100011101010100011110000101010101010010011101001100011101011010110001110101001101110001111110100110011101110101111010101111010111110101111100010100011101010011101011110 f0aaa4e98eb58ea6e3f4ceebd5ebeca48ea8f0aaa4e98eb58ea6e3f4ceebd5ebebe28ea75e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)