To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?????幼??嚥▲?誼?????肉ε? 11100001100111110011111100111111001111110011111100111111100101110110001100111111001111111001101010001011100000011010001100111111100010110110001000111111001111110011111100111111001111111001001111110111100000111100001100111111 e19f3f3f3f3f3f97633f3f9a8b81a33f8b623f3f3f3f3f93f783c33f
EUC-JP 癲?????幼??嚥▲?誼??洹??肉ε? 111000101010000100111111001111110011111100111111001111111100110111000100001111110011111111010011111010111010001010100101001111111011010111000011001111110011111110001111110001111011101000111111001111111100011011111001101001101100010100111111 e2a13f3f3f3f3fcdc43f3fd3eba2a53fb5c33f3f8fc7ba3f3fc6f9a6c53f
UTF-8 癲앷쑬梨뤄쭎幼먯춷嚥▲꺃誼띷넼洹귣봿肉ε껙 1110011110011001101100101110110010010101101101111110110010010001101011001110111110100111101000101110101110100100100001001110110010101101100011101110010110111001101111001110101110101000101011111110110010110110101101111110010110011010101001011110001010010110101100101110101010111010100000111110100010101010101111001110101110011101101101111110101110000100101111001110011010110100101110011110101010110111101000111110101110110100101111111110100010000010100010011100111010110101111010101011101110011001 e799b2ec95b7ec91acefa7a2eba484ecad8ee5b9bceba8afecb6b7e59aa5e296b2eaba83e8aabceb9db7eb84bce6b4b9eab7a3ebb4bfe88289ceb5eabb99
UHC 癲앷쑬梨뤄쭎幼먯춷嚥▲꺃誼띷넼洹귣봿肉ε껙 111011111010011010011101111010101011111010101000111011001011000110110111111011111010011110000111111010101110101010010000111011001010110110010011111001101011111110100001111000111000001110101100111010111111111010001101111001101000011010110110111010101011011110000010111010111001010010000110111010111011111110100101111001011011001010110011 efa69deabea8ecb1b7efa787eaea90ecad93e6bfa1e383acebfe8de686b6eab782eb9486ebbfa5e5b2b3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)