To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????ぁ悠お?日お??お??お◇ 001111110011111100111111001111110011111100111111100000101001111110010111010010011000001010101000001111111001001111111010100000101010100000111111001111111000001010101000001111110011111110000010101010001000000110011110 3f3f3f3f3f3f829f974982a83f93fa82a83f3f82a83f3f82a8819e
EUC-JP ??????ぁ悠お?日お??お??お◇ 001111110011111100111111001111110011111100111111101001001010000111001101101010101010010010101010001111111100011011111100101001001010101000111111001111111010010010101010001111110011111110100100101010101010000111111110 3f3f3f3f3f3fa4a1cdaaa4aa3fc6fca4aa3f3fa4aa3f3fa4aaa1fe
UTF-8 룵첂◈룵₃룵ぁ悠お룫日お룫혧お룫혧お◇ 111010111010001110110101111011001011001010000010111000101001011110001000111010111010001110110101111000101000001010000011111010111010001110110101111000111000000110000001111001101000001010100000111000111000000110001010111010111010001110101011111001101001011110100101111000111000000110001010111010111010001110101011111011011001100010100111111000111000000110001010111010111010001110101011111011011001100010100111111000111000000110001010111000101001011110000111 eba3b5ecb282e29788eba3b5e28283eba3b5e38181e682a0e3818aeba3abe697a5e3818aeba3abed98a7e3818aeba3abed98a7e3818ae29787
UHC 룵첂◈룵₃룵ぁ悠お룫日お룫혧お룫혧お◇ 1000111110101010101010101000111110100010110000101000111110101010101010011111110110001111101010101010101010100001111010101110110110101010101010101000111110100010111011001110110110101010101010101000111110100010110000101000111110101010101010101000111110100010110000101000111110101010101010101010000111011110 8faaaa8fa2c28faaa9fd8faaaaa1eaedaaaa8fa2ecedaaaa8fa2c28faaaa8fa2c28faaaaa1de

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)