To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D^ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445e
SJIS-WIN 臆??猶?????D臆??猶?????D^ 10001001101100000011111100111111100101110101000000111111001111110011111100111111001111110100010010001001101100000011111100111111100101110101000000111111001111110011111100111111001111110100010001011110 89b03f3f97503f3f3f3f3f4489b03f3f97503f3f3f3f3f445e
EUC-JP 臆??猶?????D臆??猶?????D^ 10110010101100100011111100111111110011011011000100111111001111110011111100111111001111110100010010110010101100100011111100111111110011011011000100111111001111110011111100111111001111110100010001011110 b2b23f3fcdb13f3f3f3f3f44b2b23f3fcdb13f3f3f3f3f445e
UTF-8 臆묐뜇猶뉑돹流곷쥏D臆묐뜇猶뉑돹流곷쥏D^ 111010001000011110000110111010111010110010010000111010111001110010000111111001111000110010110110111010111000100110010001111010111000111110111001111011111010011110001010111010101011001110110111111011001010010110001111010001001110100010000111100001101110101110101100100100001110101110011100100001111110011110001100101101101110101110001001100100011110101110001111101110011110111110100111100010101110101010110011101101111110110010100101100011110100010001011110 e88786ebac90eb9c87e78cb6eb8991eb8fb9efa78aeab3b7eca58f44e88786ebac90eb9c87e78cb6eb8991eb8fb9efa78aeab3b7eca58f445e
UHC 臆묐뜇猶뉑돹流곷쥏D臆묐뜇猶뉑돹流곷쥏D^ 111001011110011010010001111010111000110110001010111010111010001010000111111001101000100110111100111010101111110010000001111010111010001010001000010001001110010111100110100100011110101110001101100010101110101110100010100001111110011010001001101111001110101011111100100000011110101110100010100010000100010001011110 e5e691eb8d8aeba287e689bceafc81eba28844e5e691eb8d8aeba287e689bceafc81eba288445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)