To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??肉??宥??域??竊?┤幽??鈺?? 1110000110011111001111110011111110010011111101110011111100111111100101110100011100111111001111111000100011100110001111110011111111100010100001100011111110000100101001111001011101001000001111110011111111111011110001000011111100111111 e19f3f3f93f73f3f97473f3f88e63f3fe2863f84a797483f3ffbc43f3f
EUC-JP 癲??肉??宥??域??竊?┤幽??鈺?? 111000101010000100111111001111111100011011111001001111110011111111001101101010000011111100111111101100001110100000111111001111111110001111100110001111111010100010101001110011011010100100111111001111111000111111100011110101010011111100111111 e2a13f3fc6f93f3fcda83f3fb0e83f3fe3e63fa8a9cda93f3f8fe3d53f3f
UTF-8 癲⑸뜄肉⒳틠宥몄벑域㏓슣竊먲┤幽덀뀏鈺곌퓖 111001111001100110110010111000101001000110111000111010111001110010000100111010001000001010001001111000101001001010110011111011011000101110100000111001011010111010100101111010111010101010000100111010111011001010010001111001011001111110011111111000111000111110010011111011001000101010100011111001111010101110001010111010111010100010110010111000101001010010100100111001011011100110111101111010111000110110000000111010111000000010001111111010011000100010111010111010101011001110001100111011011001001110010110 e799b2e291b8eb9c84e88289e292b3ed8ba0e5aea5ebaa84ebb291e59f9fe38f93ec8aa3e7ab8aeba8b2e294a4e5b9bdeb8d80eb808fe988baeab38ced9396
UHC 癲⑸뜄肉⒳틠宥몄벑域㏓슣竊먲┤幽덀뀏鈺곌퓖 111011111010011010101001111010111000110110001000111010111011111110101001111001001011101010001100111010101110100110111000111011001001001110110001111001101011010010100111111010111001101010101111111011111011110010010000111011111010011010101001111010101110101110001000111000111000010110001010111010001010110110110000111010101011111110000001 efa6a9eb8d88ebbfa9e4ba8ceae9b8ec93b1e6b4a7eb9aafefbc90efa6a9eaeb88e3858ae8adb0eabf81

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)