To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????×??????A 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111101011100111111001111110011111100111111001111110011111101000001 3f3f3f3f3f3f3f3f3f3f3f3f3f3fd73f3f3f3f3f3f41
SJIS-WIN 蒻れ?泣??隱??猿????×肉?????A 1110010011101000100000101110101000111111100010111000001100111111001111111110100010101010001111110011111110001001100011100011111100111111001111110011111110000001011111101001001111110111001111110011111100111111001111110011111101000001 e4e882ea3f8b833f3fe8aa3f3f898e3f3f3f3f817e93f73f3f3f3f3f41
EUC-JP 蒻れ?泣??隱??猿??孼?×肉?????A 11101000111010101010010011101100001111111011010111100011001111110011111111110000101011000011111100111111101100011110111000111111001111111000111110111010110000110011111110100001110111111100011011111001001111110011111100111111001111110011111101000001 e8eaa4ec3fb5e33f3ff0ac3f3fb1ee3f3f8fbac33fa1dfc6f93f3f3f3f3f41
UTF-8 蒻れ슦泣길룚隱닷쩂猿딆뵛孼뽰×肉욜윢紐꾨룈A 111010001001001010111011111000111000001010001100111011001000101010100110111001101011001110100011111010101011100010111000111010111010001110011010111010011001101010110001111010111000101110110111111011001010100110000010111001111000110010111111111010111001010010000110111010111011010110011011111001011010110110111100111010111011110110110000110000111001011111101000100000101000100111101100100110101001110011101100100111001010001011101111101001111000111111101010101111101010100011101011101000111000100001000001 e892bbe3828cec8aa6e6b3a3eab8b8eba39ae99ab1eb8bb7eca982e78cbfeb9486ebb59be5adbcebbdb0c397e88289ec9a9cec9ca2efa78feabea8eba38841
UHC 蒻れ슦泣길룚隱닷쩂猿딆뵛孼뽰×肉욜윢紐꾨룈A 11100101101101101010101011101100100110101011000011101011111010001011000111100110100011111001011011101011110111111011010011100101101001001001110011101010101110111000101011101100100101001001101111100101111011011001011011101100101000011011111111101011101111111011111111100111100111111010001111101011101010101000010011101011100011111000011101000001 e5b6aaec9ab0ebe8b1e68f96ebdfb4e5a49ceabb8aec949be5ed96eca1bfebbfbfe79fa3ebaa84eb8f8741

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)