To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 畑??悠?ぜ宋??嚥△?溢←?怨??藥 1001010010101000001111110011111110010111010010010011111110000010101110101001000101110110001111110011111110011010100010111000000110100010001111111000100011101100100000011010100100111111100010011000010100111111001111111110010101011010 94a83f3f97493f82ba91763f3f9a8b81a23f88ec81a93f89853f3fe55a
EUC-JP 畑??悠?ぜ宋??嚥△?溢←?怨??藥 1100100010101010001111110011111111001101101010100011111110100100101111001100000111010111001111110011111111010011111010111010001010100100001111111011000011101110101000101010101100111111101100011110010100111111001111111110100110111011 c8aa3f3fcdaa3fa4bcc1d73f3fd3eba2a43fb0eea2ab3fb1e53f3fe9bb
UTF-8 畑밴퉭悠껇ぜ宋볧꼤嚥△뫁溢←뵓怨뺣쎘藥 111001111001010110010001111010111011000010110100111011011000100110101101111001101000001010100000111010101011101110000111111000111000000110011100111001011010111010001011111010111011001110100111111010101011110010100100111001011001101010100101111000101001011010110011111010111010101110000001111001101011101010100010111000101000011010010000111010111011010110010011111001101000000010101000111010111011101010100011111011001000111010011000111010001001011110100101 e79591ebb0b4ed89ade682a0eabb87e3819ce5ae8bebb3a7eabca4e59aa5e296b3ebab81e6baa2e28690ebb593e680a8ebbaa3ec8e98e897a5
UHC 畑밴퉭悠껇ぜ宋볧꼤嚥△뫁溢←뵓怨뺣쎘藥 1110111110100101101110011110101010111001100001011110101011101101100000111110100010101010101111001110000111100100100100111110110110000100100000011110011010111111101000011110001010010001101001011110110011101110101000011110011110010100100101011110101010110011100101011110101110011011101111111110010110110111 efa5b9eab985eaed83e8aabce1e493ed8481e6bfa1e291a5eceea1e79495eab395eb9bbfe5b7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)