To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?8意??袁μ?億???域?????猷??裔 1110000110011111001111111000001001010111100010001101001100111111001111111110010111001101100000111100101000111111100010011010110100111111001111110011111110001000111001100011111100111111001111110011111100111111100101110101000100111111001111111110010111100001 e19f3f825788d33f3fe5cd83ca3f89ad3f3f3f88e63f3f3f3f3f97513f3fe5e1
EUC-JP 癲?8意??袁μ?億???域??馹??猷??裔 11100010101000010011111110100011101110001011000011010101001111110011111111101010110011111010011011001100001111111011001010101111001111110011111100111111101100001110100000111111001111111000111111101001101000010011111100111111110011011011001000111111001111111110101011100011 e2a13fa3b8b0d53f3feacfa6cc3fb2af3f3f3fb0e83f3f8fe9a13f3fcdb23f3feae3
UTF-8 癲쒕8意덅굢袁μ칰億왁룸쇀域㏃쥓馹숋쬅猷붽틙裔 1110011110011001101100101110110010010010100101011110111110111100100110001110011010000100100011111110101110001101100001011110101010110101101000101110100010100010100000011100111010111100111011001011100110110000111001011000010010000100111011001001100110000001111010111010001110111000111011001000011110000000111001011001111110011111111000111000111110000011111011001010010110010011111010011010011010111001111011001000100010001011111011001010110010000101111001111000110010110111111010111011011010111101111011011000101110011001111010001010001110010100 e799b2ec9295efbc98e6848feb8d85eab5a2e8a281cebcecb9b0e58484ec9981eba3b8ec8780e59f9fe38f83eca593e9a6b9ec888becac85e78cb7ebb6bded8b99e8a394
UHC 癲쒕8意덅굢袁μ칰億왁룸쇀域㏃쥓馹숋쬅猷붽틙裔 11101111101001101001110011101011101000111011100011101011111100101000100011101000100000101000100111101010101111101010010111101100101011111000001111100101111000101011111111001110101101111110101110011001101101001110011010110100101001111110110010100010100010101110110011110001100110011110111110100110100111001110101110100011100101001110101010111010100001101110011111100000 efa69ceba3b8ebf288e88289eabea5ecaf83e5e2bfceb7eb99b4e6b4a7eca28aecf199efa69ceba394eaba86e7e0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)