To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲???▽?蹂≪?如??諭??儀??億 111000011001111100111111001111110011111110000001101001000011111111100110111110001000000111100001001111111001010001000000001111110011111110010111010000000011111100111111100010110101011000111111001111111000100110101101 e19f3f3f3f81a43fe6f881e13f94403f3f97403f3f8b563f3f89ad
EUC-JP 癲??沅▽?蹂≪?如??諭??儀??億 1110001010100001001111110011111110001111110001101110100110100010101001100011111111101100111110101010001011100011001111111100011110100001001111110011111111001101101000010011111100111111101101011011011100111111001111111011001010101111 e2a13f3f8fc6e9a2a63fecfaa2e33fc7a13f3fcda13f3fb5b73f3fb2af
UTF-8 癲숆낯沅▽펺蹂≪낄如붵끋諭븝쬉儀볥궞億 111001111001100110110010111011001000100010000110111010111000001010101111111001101011001010000101111000101001011010111101111011011000111010111010111010001011100110000010111000101000100110101010111010111000001010000100111001011010011010000010111010111011011010110101111010111000000110001011111010001010101110101101111010111011100010011101111011001010110010001001111001011000010010000000111010111011001110100101111010101011011010011110111001011000010010000100 e799b2ec8886eb82afe6b285e296bded8ebae8b982e289aaeb8284e5a682ebb6b5eb818be8abadebb89decac89e58480ebb3a5eab69ee58484
UHC 癲숆낯沅▽펺蹂≪낄如붵끋諭븝쬉儀볥궞億 1110111110100110100110011110101010110011101110001110101010110110101000011110010010111100100010101110101110110011101000011110110010110011101001011110010111111101100101001110001110000101101111011110101110110001101110101110111110100110100111111110101111110000100100111110101110000010101100011110010111100010 efa699eab3b8eab6a1e4bc8aebb3a1ecb3a5e5fd94e385bdebb1baefa69febf093eb82b1e5e2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)