To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??悠??宥?????悠??揄??孃ゃ?? 1110000110011111001111110011111110010111010010010011111100111111100101110100011100111111001111110011111100111111001111111001011101001001001111110011111110011101100010010011111100111111100110110110111110000010111000010011111100111111 e19f3f3f97493f3f97473f3f3f3f3f97493f3f9d893f3f9b6f82e13f3f
EUC-JP 癲??悠??宥?????悠??揄??孃ゃ?? 1110001010100001001111110011111111001101101010100011111100111111110011011010100000111111001111110011111100111111001111111100110110101010001111110011111111011001111010010011111100111111110101011101000010100100111000110011111100111111 e2a13f3fcdaa3f3fcda83f3f3f3f3fcdaa3f3fd9e93f3fd5d0a4e33f3f
UTF-8 癲욌맧悠띷끽宥밸쎗廬믩똾悠썽죲揄쇰꼤孃ゃ끂劉 111001111001100110110010111011001001101010001100111010111010011110100111111001101000001010100000111010111001110110110111111010111000000110111101111001011010111010100101111010111011000010111000111011001000111010010111111011111010011010000010111010111010111110101001111010111001100010111110111001101000001010100000111011001000110110111101111011001010001110110010111001101000111110000100111011001000011110110000111010101011110010100100111001011010110110000011111000111000001010000011111010111000000110000010111011111010011110000111 e799b2ec9a8ceba7a7e682a0eb9db7eb81bde5aea5ebb0b8ec8e97efa682ebafa9eb98bee682a0ec8dbdeca3b2e68f84ec87b0eabca4e5ad83e38283eb8182efa787
UHC 癲욌맧悠띷끽宥밸쎗廬믩똾悠썽죲揄쇰꼤孃ゃ끂劉 1110111110100110100111101110101110010000101100001110101011101101100011011110011010110011101000111110101011101001101110011110101110011011101111101110010111111110100100101110101110001100100001001110101011101101101111011110100110100001100011011110101011110001101111001110101110000100100000011110010110111110101010101110001110000101101110001110101011100101 efa69eeb90b0eaed8de6b3a3eae9b9eb9bbee5fe92eb8c84eaedbde9a18deaf1bceb8481e5beaae385b8eae5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)