To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蒻れ?泣??隱??矣?????幼?ⅰ儒?? 1110010011101000100000101110101000111111100010111000001100111111001111111110100010101010001111110011111111100001111000010011111100111111001111110011111100111111100101110110001100111111111110100100000010001110111100100011111100111111 e4e882ea3f8b833f3fe8aa3f3fe1e13f3f3f3f3f97633ffa408ef23f3f
EUC-JP 蒻れ?泣??隱??矣??孼??幼??儒?? 111010001110101010100100111011000011111110110101111000110011111100111111111100001010110000111111001111111110001011100011001111110011111110001111101110101100001100111111001111111100110111000100001111110011111110111100111101000011111100111111 e8eaa4ec3fb5e33f3ff0ac3f3fe2e33f3f8fbac33f3fcdc43f3fbcf43f3f
UTF-8 蒻れ슦泣길룚隱욃땔矣곗뒻孼뽰꼳幼뽪ⅰ儒밸렰 111010001001001010111011111000111000001010001100111011001000101010100110111001101011001110100011111010101011100010111000111010111010001110011010111010011001101010110001111011001001101010000011111010111001010110010100111001111001111110100011111010101011001110010111111010111001001010111011111001011010110110111100111010111011110110110000111010101011110010110011111001011011100110111100111010111011110110101010111000101000010110110000111001011000010010010010111010111011000010111000111010111010000010110000 e892bbe3828cec8aa6e6b3a3eab8b8eba39ae99ab1ec9a83eb9594e79fa3eab397eb92bbe5adbcebbdb0eabcb3e5b9bcebbdaae285b0e58492ebb0b8eba0b0
UHC 蒻れ슦泣길룚隱욃땔矣곗뒻孼뽰꼳幼뽪ⅰ儒밸렰 111001011011011010101010111011001001101010110000111010111110100010110001111001101000111110010110111010111101111110011110111001011011011010101010111010111111100010110000111011001000101010110001111001011110110110010110111011001000010010001100111010101110101010010110111001101010010110100001111010101110001110111001111010111000111010111101 e5b6aaec9ab0ebe8b1e68f96ebdf9ee5b6aaebf8b0ec8ab1e5ed96ec848ceaea96e6a5a1eae3b9eb8ebd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)