To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 歪??油??癒ロ?冗??循??蟻??藥 100110000110001100111111001111111001011011111011001111110011111110010110111111001000001110001101001111111000111111100111001111110011111110001111011110100011111100111111100010110110000100111111001111111110010101011010 98633f3f96fb3f3f96fc838d3f8fe73f3f8f7a3f3f8b613f3fe55a
EUC-JP 歪??油??癒ロ?冗??循??蟻??藥 110011111100010000111111001111111100110011111101001111110011111111001100111111101010010111101101001111111011111011101001001111110011111110111101110110110011111100111111101101011100001000111111001111111110100110111011 cfc43f3fccfd3f3fccfea5ed3fbee93f3fbddb3f3fb5c23f3fe9bb
UTF-8 歪뺤옕油얕맱癒ロ뜥冗뱀늿循깁굲蟻쏅짗藥 111001101010110110101010111010111011101010100100111011001001100010010101111001101011001010111001111011001001011010010101111010111010011110110001111001111001100110010010111000111000001110101101111010111001110010100101111001011000011010010111111010111011000110000000111010111000101010111111111001011011111010101010111010101011100110000001111010101011010110110010111010001001111110111011111011001000111110000101111011001010011110010111111010001001011110100101 e6adaaebbaa4ec9895e6b2b9ec9695eba7b1e79992e383adeb9ca5e58697ebb180eb8abfe5beaaeab981eab5b2e89fbbec8f85eca797e897a5
UHC 歪뺤옕油얕맱癒ロ뜥冗뱀늿循깁굲蟻쏅짗藥 1110100011100000100101011110110010011110100110111110101011111010101111101110100010010000101110001110101110101000101010111110110110001101101010001110100110110111101110011110110010001000100010001110001011100000101100011110100110000010100101011110101111111100100110111110101110100011100111101110010110110111 e8e095ec9e9beafabee890b8eba8abed8da8e9b7b9ec8888e2e0b1e98295ebfc9beba39ee5b7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)