To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ??孩?????必燎藺咐げ??脹???B 001111110011111110011011011101110011111100111111001111110011111100111111100101010100101111100000100110011110010101100001100110011111001110000010101100000011111100111111100100101010111100111111001111110011111101000010 3f3f9b773f3f3f3f3f954be099e56199f382b03f3f92af3f3f3f42
EUC-JP ?堞孩?????必燎藺咐げ??脹???B 0011111110001111101110001010010011010101110110000011111100111111001111110011111100111111110010011010110011011111111110011110100111000010110100101111010110100100101100100011111100111111110001001011000100111111001111110011111101000010 3f8fb8a4d5d83f3f3f3f3fc9acdff9e9c2d2f5a4b23f3fc4b13f3f3f42
UTF-8 뤋堞孩쾸쵍샅렟뤋必燎藺咐げ렗뤋脹컦샘그B 11101011101001001000101111100101101000001001111011100101101011011010100111101100101111101011100011101100101101011000110111101100100000111000010111101011101000001001111111101011101001001000101111100101101111111000010111100111100001111000111011101000100101111011101011100101100100101001000011100011100000011001001011101011101000001001011111101011101001001000101111101000100001001011100111101100101110111010011011101100100000111001100011101010101101111011100001000010 eba48be5a09ee5ada9ecbeb8ecb58dec8385eba09feba48be5bf85e7878ee897bae59290e38192eba097eba48be884b9ecbba6ec8398eab7b842
UHC 뤋堞孩쾸쵍샅렟뤋必燎藺咐げ렗뤋脹컦샘그B 100011111011101111110100110111001111101010101001101100101000111010101100100011111011101111110100100011101011000010001111101110111111100110110001110101101111101011010111111101001101110011111011101010101011001010001110101011001000111110111011111100111110110010110000100011111011101111111001101100011101011101000010 8fbbf4dcfaa9b28eac8fbbf48eb08fbbf9b1d6fad7f4dcfbaab28eac8fbbf3ecb08fbbf9b1d742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)