To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 俑??萸??韋??語??油ょ?寃??亦 100110001101101000111111001111111110010011001110001111110011111111101000111010000011111100111111100011001110101000111111001111111001011011111011100000101110010100111111100110111000001100111111001111111001011010010010 98da3f3fe4ce3f3fe8e83f3f8cea3f3f96fb82e53f9b833f3f9692
EUC-JP 俑??萸??韋??語??油ょ?寃??亦 110100001101110000111111001111111110100011010000001111110011111111110000111010100011111100111111101110001110110000111111001111111100110011111101101001001110011100111111110101011110001100111111001111111100101111110010 d0dc3f3fe8d03f3ff0ea3f3fb8ec3f3fccfda4e73fd5e33f3fcbf2
UTF-8 俑앹늿萸쇘땟韋얜겱語ⓦ꺃油ょ깗寃몃쳴亦 111001001011111110010001111011001001010110111001111010111000101010111111111010001001000010111000111011001000011110011000111010111001010110011111111010011001111110001011111011001001011010011100111010101011001010110001111010001010101010011110111000101001001110100110111010101011101010000011111001101011001010111001111000111000001010000111111010101011100110010111111001011010111110000011111010111010101010000011111011001011001110110100111001001011101010100110 e4bf91ec95b9eb8abfe890b8ec8798eb959fe99f8bec969ceab2b1e8aa9ee293a6eaba83e6b2b9e38287eab997e5af83ebaa83ecb3b4e4baa6
UHC 俑앹늿萸쇘땟韋얜겱語ⓦ꺃油ょ깗寃몃쳴亦 1110100110110101100111011110110010001000100010001110101110101101101111001110011110110110101011011110101011011111101111101110101110000001101111011110010111011110101010001110001110000011101011001110101011111010101010101110011110000011100011111110101010110010101110001110101110101011100101111110011010110010 e9b59dec8888ebadbce7b6adeadfbeeb81bde5dea8e383aceafaaae7838feab2b8ebab97e6b2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)