To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蒻れ?泣??隱??矣?????釗ζ?怨?? 1110010011101000100000101110101000111111100010111000001100111111001111111110100010101010001111110011111111100001111000010011111100111111001111110011111100111111111110111011101110000011110001000011111110001001100001010011111100111111 e4e882ea3f8b833f3fe8aa3f3fe1e13f3f3f3f3ffbbb83c43f89853f3f
EUC-JP 蒻れ?泣??隱??矣??孼??釗ζ?怨?? 1110100011101010101001001110110000111111101101011110001100111111001111111111000010101100001111110011111111100010111000110011111100111111100011111011101011000011001111110011111110001111111000111010011010100110110001100011111110110001111001010011111100111111 e8eaa4ec3fb5e33f3ff0ac3f3fe2e33f3f8fbac33f3f8fe3a6a6c63fb1e53f3f
UTF-8 蒻れ슦泣길룚隱잌땔矣곗뒻孼뽰꼻釗ζ윍怨쀫뮓 1110100010010010101110111110001110000010100011001110110010001010101001101110011010110011101000111110101010111000101110001110101110100011100110101110100110011010101100011110110010011110100011001110101110010101100101001110011110011111101000111110101010110011100101111110101110010010101110111110010110101101101111001110101110111101101100001110101010111100101110111110100110000111100101111100111010110110111011001001110010001101111001101000000010101000111011001000000010101011111010111010111010010011 e892bbe3828cec8aa6e6b3a3eab8b8eba39ae99ab1ec9e8ceb9594e79fa3eab397eb92bbe5adbcebbdb0eabcbbe98797ceb6ec9c8de680a8ec80abebae93
UHC 蒻れ슦泣길룚隱잌땔矣곗뒻孼뽰꼻釗ζ윍怨쀫뮓 111001011011011010101010111011001001101010110000111010111110100010110001111001101000111110010110111010111101111110011111111001011011011010101010111010111111100010110000111011001000101010110001111001011110110110010110111011001000010010010011111000011111001010100101111001101001111110010100111010101011001110010111111010111001001010011111 e5b6aaec9ab0ebe8b1e68f96ebdf9fe5b6aaebf8b0ec8ab1e5ed96ec8493e1f2a5e69f94eab397eb929f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)