To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?????????????㎝????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111000011101110000001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f87703f3f3f3f3f3f3f3f3f
EUC-JP ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
UTF-8 梨숈㏈吏쵡梨좏삺吏앹콡吏좎㎝梨숈㏏吏몄콡吏쇱쮫 111011111010011110100010111011001000100010001000111000111000111110001000111011111010011110011110111011001011010110100001111011111010011110100010111011001010001010001111111011001000001010111010111011111010011110011110111011001001010110111001111011001011110110100001111011111010011110011110111011001010001010001110111000111000111010011101111011111010011110100010111011001000100010001000111000111000111110001111111011111010011110011110111010111010101010000100111011001011110110100001111011111010011110011110111011001000011110110001111011001010111010101011 efa7a2ec8888e38f88efa79eecb5a1efa7a2eca28fec82baefa79eec95b9ecbda1efa79eeca28ee38e9defa7a2ec8888e38f8fefa79eebaa84ecbda1efa79eec87b1ecaeab
UHC 梨숈㏈吏쵡梨좏삺吏앹콡吏좎㎝梨숈㏏吏몄콡吏쇱쮫 11101100101100011001100111101100101001111011110011101100101001111010110101000001111011001011000110100000111011011001100010110001111011001010011110011101111011001011000110011001111011001010011110100000111011001010011110101111111011001011000110011001111011001010011110111001111011001010011110111000111011001011000110011001111011001010011110111100111011001010100010001000 ecb199eca7bceca7ad41ecb1a0ed98b1eca79decb199eca7a0eca7afecb199eca7b9eca7b8ecb199eca7bceca888

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)