To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 藥???у?熬??鵝?????言??汝?? 111001010101101000111111001111110011111110000100100001010011111111100000100100100011111100111111111010100100000000111111001111110011111100111111001111111000110010111110001111110011111110010011111100000011111100111111 e55a3f3f3f84853fe0923f3fea403f3f3f3f3f8cbe3f3f93f03f3f
EUC-JP 藥???у?熬??鵝??孼??言??汝?? 1110100110111011001111110011111100111111101001111110010100111111110111111111001000111111001111111111001110100001001111110011111110001111101110101100001100111111001111111011100011000000001111110011111111000110111100100011111100111111 e9bb3f3f3fa7e53fdff23f3ff3a13f3f8fbac33f3fb8c03f3fc6f23f3f
UTF-8 藥썸릍歷у렘熬뽪뿈鵝녵퐜孼껃렘言됪겘汝싩떻 1110100010010111101001011110110010001101101110001110101110100110100011011110111110100110100011001101000110000011111010111010000010011000111001111000011010101100111010111011110110101010111010111011111110001000111010011011010110011101111010111000010110110101111011011001000010011100111001011010110110111100111010101011101110000011111010111010000010011000111010001010100010000000111010111001000010101010111010101011001010011000111001101011000110011101111011001000101110101001111010111001011010111011 e897a5ec8db8eba68defa68cd183eba098e786acebbdaaebbf88e9b59deb85b5ed909ce5adbceabb83eba098e8a880eb90aaeab298e6b19dec8ba9eb96bb
UHC 藥썸릍歷у렘熬뽪뿈鵝녵퐜孼껃렘言됪겘汝싩떻 111001011011011110111101111001101011100010101100111001101011100010101100111001011011011110111101111010001010001010010110111001101001011110001111111001001011110110000110111001001011110110000110111001011110110110000011111001011011011110111101111001011110101110001001111001101000000110101111111001101010001110011010111001111011011010111011 e5b7bde6b8ace6b8ace5b7bde8a296e6978fe4bd86e4bd86e5ed83e5b7bde5eb89e681afe6a39ae7b6bb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)