To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN セ齒ツァ実錂斜 10111110111010101000111111000010111100001000111010100111100011101100000011111011110111101000111011001110 beea8fc2f08ea78ec0fbde8ece
EUC-JP セ齒ツ?ァ実錂斜 10001110101111101111001111101111100011101100001000111111100011101010011110111100110000101000111111100100110101001011110011010000 8ebef3ef8ec23f8ea7bcc28fe4d4bcd0
UTF-8 セ齒ツァ実錂斜 111011111011110110111110111010011011110110010010111011111011111010000010111011101000000110001101111011111011110110100111111001011010111010011111111010011000110010000010111001101001011010011100 efbdbee9bd92efbe82ee818defbda7e5ae9fe98c82e6969c
UHC ?齒?????斜 00111111111101101100110100111111001111110011111100111111001111111101111011011000 3ff6cd3f3f3f3f3fded8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)