To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??毅?ぜ矣??猥??誼?┯喩??沃 11100001100111110011111100111111100010110100001000111111100000101011101011100001111000010011111100111111111000001100111000111111001111111000101101100010001111111000010010110110100110100110011100111111001111111001011110000000 e19f3f3f8b423f82bae1e13f3fe0ce3f3f8b623f84b69a673f3f9780
EUC-JP 癲??毅?ぜ矣??猥??誼?┯喩??沃 11100010101000010011111100111111101101011010001100111111101001001011110011100010111000110011111100111111111000001101000000111111001111111011010111000011001111111010100010111000110100111100100000111111001111111100110111100000 e2a13f3fb5a33fa4bce2e33f3fe0d03f3fb5c33fa8b8d3c83f3fcde0
UTF-8 癲몃돆毅볢ぜ矣쒖땡猥됰씮誼띰┯喩쀬삌沃 111001111001100110110010111010111010101010000011111010111000111110000110111001101010111110000101111010111011001110100010111000111000000110011100111001111001111110100011111011001001001010010110111010111001010110100001111001111000110010100101111010111001000010110000111011001001010010101110111010001010101010111100111010111001110110110000111000101001010010101111111001011001011010101001111011001000000010101100111011001000001010001100111001101011001010000011 e799b2ebaa83eb8f86e6af85ebb3a2e3819ce79fa3ec9296eb95a1e78ca5eb90b0ec94aee8aabceb9db0e294afe596a9ec80acec828ce6b283
UHC 癲몃돆毅볢ぜ矣쒖땡猥됰씮誼띰┯喩쀬삌沃 1110111110100110101110001110101110001001100101111110101111110110100100111110100010101010101111001110101111111000100111001110110010110110101011111110100011100101100010011110101110011101101111111110101111111110101101101110111110100110101110001110101011100111100101111110110010011000100100111110100010101010 efa6b8eb8997ebf693e8aabcebf89cecb6afe8e589eb9dbfebfeb6efa6b8eae797ec9893e8aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)