To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????B 001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f42
SJIS-WIN 蝨磯、牌蝨磯、牌B 111001011001110010001000111010011010010010010100011101101110010110011100100010001110100110100100100101000111011001000010 e59c88e9a49476e59c88e9a4947642
EUC-JP 蝨磯、牌蝨磯、牌B 1110100111111100101100001110101110001110101001001100011111010111111010011111110010110000111010111000111010100100110001111101011101000010 e9fcb0eb8ea4c7d7e9fcb0eb8ea4c7d742
UTF-8 蝨磯、牌蝨磯、牌B 11101000100111011010100011100111101000111010111111101111101111011010010011100111100010011000110011101000100111011010100011100111101000111010111111101111101111011010010011100111100010011000110001000010 e89da8e7a3afefbda4e7898ce89da8e7a3afefbda4e7898c42
UHC 蝨磯?牌蝨磯?牌B 111000111010010011010001101101000011111111111000101010111110001110100100110100011011010000111111111110001010101101000010 e3a4d1b43ff8abe3a4d1b43ff8ab42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)