To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???楢??矣??語⑤????????ъ?爾 00111111001111110011111110010011111010000011111100111111111000011110000100111111001111111000110011101010100001110100010000111111001111110011111100111111001111110011111100111111001111111000010010001100001111111000111010100010 3f3f3f93e83f3fe1e13f3f8cea87443f3f3f3f3f3f3f3f848c3f8ea2
EUC-JP ???楢??矣??語??嫄??????ъ?爾 0011111100111111001111111100011011101010001111110011111111100010111000110011111100111111101110001110110000111111001111111000111110111010101000010011111100111111001111110011111100111111001111111010011111101100001111111011110010100100 3f3f3fc6ea3f3fe2e33f3fb8ec3f3f8fbaa13f3f3f3f3f3fa7ec3fbca4
UTF-8 嶺뚮슢楢쇗윓矣몄쵄語⑤베嫄숅쐯栒뀀섰嶪ъ뜾爾 1110111110100110101010111110101110011010101011101110110010001010101000101110011010100101101000101110110010000111100101111110110010011100100100111110011110011111101000111110101110101010100001001110110010110101100001001110100010101010100111101110001010010001101001001110101110110010101000001110010110101011100001001110110010001000100001011110110010010000101011111110011010100000100100101110101110000000100000001110110010000100101100001110010110110110101010101101000110001010111010111001110010111110111001111000100010111110 efa6abeb9aaeec8aa2e6a5a2ec8797ec9c93e79fa3ebaa84ecb584e8aa9ee291a4ebb2a0e5ab84ec8885ec90afe6a092eb8080ec84b0e5b6aad18aeb9cbee788be
UHC 嶺뚮슢楢쇗윓矣몄쵄語⑤베嫄숅쐯栒뀀섰嶪ъ뜾爾 1110011110101101100011001110101110011010101011101110101011111001101111001110011010011111100110101110101111111000101110001110110010101100100001101110010111011110101010001110101110111010101000111110101010110001100110011110100110011100100100111110001011100011101100101110101110111100101110011110010111110101101011001110110010001101101110011110110010110011 e7ad8ceb9aaeeaf9bce69f9aebf8b8ecac86e5dea8ebbaa3eab199e99c93e2e3b2ebbcb9e5f5acec8db9ecb3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)