To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 癲??孺??源??筌??誼??惟??冗??^ 1110000110011111001111110011111110011011011111010011111100111111100011001011100100111111001111111110001010100011001111110011111110001011011000100011111100111111100010001101001000111111001111111000111111100111001111110011111101011110 e19f3f3f9b7d3f3f8cb93f3fe2a33f3f8b623f3f88d23f3f8fe73f3f5e
EUC-JP 癲?Ŋ孺??源??筌??誼??惟??冗??^ 11100010101000010011111110001111101010011010101111010101110111100011111100111111101110001011101100111111001111111110010010100101001111110011111110110101110000110011111100111111101100001101010000111111001111111011111011101001001111110011111101011110 e2a13f8fa9abd5de3f3fb8bb3f3fe4a53f3fb5c33f3fb0d43f3fbee93f3f5e
UTF-8 癲앸Ŋ孺삼쭓源딅즷筌뗪퀡誼뚪썚惟깅폀冗뱀걿^ 111001111001100110110010111011001001010110111000110001011000101011100101101011011011101011101100100000101011110011101100101011011001001111100110101110101001000011101011100101001000010111101100101001101011011111100111101011011000110011101011100101111010101011101101100000001010000111101000101010101011110011101011100110101010101011101100100011011001101011100110100000111001111111101010101110011000010111101101100011111000000011100101100001101001011111101011101100011000000011101010101100011011111101011110 e799b2ec95b8c58ae5adbaec82bcecad93e6ba90eb9485eca6b7e7ad8ceb97aaed80a1e8aabceb9aaaec8d9ae6839feab985ed8f80e58697ebb180eab1bf5e
UHC 癲앸Ŋ孺삼쭓源딅즷筌뗪퀡誼뚪썚惟깅폀冗뱀걿^ 11101111101001101001110111101011101010001010111111101010111010001011101111101111101001111000101111101010101110011000101011101011101000111000100111101111101001111000101111101010101100111001010111101011111111101000110011101001100110111000110111101010111011101011000111101011101111001000111111101001101101111011100111101100100000011010001001011110 efa69deba8afeae8bbefa78beab98aeba389efa78beab395ebfe8ce99b8deaeeb1ebbc8fe9b7b9ec81a25e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)