To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??毅?ぜ矣??繹??誼??諛??沃 111000011001111100111111001111111000101101000010001111111000001010111010111000011110000100111111001111111110001110001000001111110011111110001011011000100011111100111111111001101000011100111111001111111001011110000000 e19f3f3f8b423f82bae1e13f3fe3883f3f8b623f3fe6873f3f9780
EUC-JP 癲??毅?ぜ矣??繹??誼??諛??沃 111000101010000100111111001111111011010110100011001111111010010010111100111000101110001100111111001111111110010111101000001111110011111110110101110000110011111100111111111010111110011100111111001111111100110111100000 e2a13f3fb5a33fa4bce2e33f3fe5e83f3fb5c33f3febe73f3fcde0
UTF-8 癲몃돆毅볢ぜ矣몄춪繹먮굞誼당븨諛몃쾳沃 111001111001100110110010111010111010101010000011111010111000111110000110111001101010111110000101111010111011001110100010111000111000000110011100111001111001111110100011111010111010101010000100111011001011011010101010111001111011100110111001111010111010100010101110111010101011010110011110111010001010101010111100111010111000101110111001111010111011100010101000111010001010101110011011111010111010101010000011111011001011111010110011111001101011001010000011 e799b2ebaa83eb8f86e6af85ebb3a2e3819ce79fa3ebaa84ecb6aae7b9b9eba8aeeab59ee8aabceb8bb9ebb8a8e8ab9bebaa83ecbeb3e6b283
UHC 癲몃돆毅볢ぜ矣몄춪繹먮굞誼당븨諛몃쾳沃 1110111110100110101110001110101110001001100101111110101111110110100100111110100010101010101111001110101111111000101110001110110010101101100001111110011010111010100100001110101110000010100001101110101111111110101101001110011110010101100100011110101110110000101110001110101110110010100010011110100010101010 efa6b8eb8997ebf693e8aabcebf8b8ecad87e6ba90eb8286ebfeb4e79591ebb0b8ebb289e8aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)