To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???榮??鸚?????節?ケ淞ュ?? 00111111001111110011111110011110110001000011111100111111111010100101111100111111001111110011111100111111001111111001000011011111001111111000001101010000100111111100001010000011100001010011111100111111 3f3f3f9ec43f3fea5f3f3f3f3f3f90df3f83509fc283853f3f
EUC-JP 獒??榮??鸚??艅??節?ケ淞ュ?渶 10001111110010111011101100111111001111111101110011000110001111110011111111110011110000000011111100111111100011111101011011111101001111110011111111000000111000010011111110100101101100011101111011000100101001011110010100111111100011111100011111101101 8fcbbb3f3fdcc63f3ff3c03f3f8fd6fd3f3fc0e13fa5b1dec4a5e53f8fc7ed
UTF-8 獒꺿닖榮붺껍鸚뀐쉑艅꾬슘節김ケ淞ュ쨱渶 111001111000110110010010111010101011101010111111111010111000101110010110111001101010011010101110111010111011011010111010111010101011101110001101111010011011100010011010111010111000000010010000111011001000100110010001111010001000100110000101111010101011111010101100111011001000101010011000111001111010111110000000111010101011100110000000111000111000001010110001111001101011011110011110111000111000001110100101111011001010100010110001111001101011100010110110 e78d92eababfeb8b96e6a6aeebb6baeabb8de9b89aeb8090ec8991e88985eabeacec8a98e7af80eab980e382b1e6b79ee383a5eca8b1e6b8b6
UHC 獒꺿닖榮붺껍鸚뀐쉑艅꾬슘節김ケ淞ュ쨱渶 1110100010100011100000111110001010001000100110101110011110110100100101001110011110110010101011101110010110100100101100101110111110111101101001111110011010101001100001001110111110111101101101111110111110111101101100011110100010101011101100011110000111100111101010111110010110100100100010111110011110110111 e8a383e2889ae7b494e7b2aee5a4b2efbda7e6a984efbdb7efbdb1e8abb1e1e7abe5a48be7b7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)