To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???泣??魏??B 001111110011111100111111100010111000001100111111001111111110100110110000001111110011111101000010 3f3f3f8b833f3fe9b03f3f42
EUC-JP ???泣??魏??B 001111110011111100111111101101011110001100111111001111111111001010110010001111110011111101000010 3f3f3fb5e33f3ff2b23f3f42
UTF-8 捻뀀슢泣뚪춳魏놁냱B 11101111101001101010010011101011100000001000000011101100100010101010001011100110101100111010001111101011100110101010101011101100101101101011001111101001101011011000111111101011100001101000000111101011100000111011000101000010 efa6a4eb8080ec8aa2e6b3a3eb9aaaecb6b3e9ad8feb8681eb83b142
UHC 捻뀀슢泣뚪춳魏놁냱B 11100110111101111011001011101011100110101010111011101011111010001000110011101001101011011000111111101010111000001000011011101100100001101000000101000010 e6f7b2eb9aaeebe88ce9ad8feae086ec868142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)