To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 訒夭貊カ頷誹貊ケ蓆豈訒夲スカ髣費スケ蜿アB 11111011101000111001101011101110111001101011101110110110111010001111010110010100111011101110011010111011101110011110010011101100111001101010111111111011101000111001101011101111101111011011011011101001100101111001010011101111101111011011100111100101100011111011000101000010 fba39aeee6bbb6e8f594eee6bbb9e4ece6affba39aefbdb6e99794efbdb9e58fb142
EUC-JP 訒夭貊カ頷誹貊ケ蓆豈訒夲スカ髣費スケ蜿アB 10001111110111011100100011010100111100001110110010111101100011101011011011110000111101111100100011110000111011001011110110001110101110011110100011101110111011001011000110001111110111011100100011010100111100011000111010111101100011101011011011110001111101111100100011110001100011101011110110001110101110011110100111101111100011101011000101000010 8fddc8d4f0ecbd8eb6f0f7c8f0ecbd8eb9e8eeecb18fddc8d4f18ebd8eb6f1f7c8f18ebd8eb9e9ef8eb142
UTF-8 訒夭貊カ頷誹貊ケ蓆豈訒夲スカ髣費スケ蜿アB 11101000101010001001001011100101101001001010110111101000101100101000101011101111101111011011011011101001101000001011011111101000101010101011100111101000101100101000101011101111101111011011100111101000100100111000011011101000101100011000100011101000101010001001001011100101101001001011001011101111101111011011110111101111101111011011011011101001101010111010001111101000101100101011101111101111101111011011110111101111101111011011100111101000100111001011111111101111101111011011000101000010 e8a892e5a4ade8b28aefbdb6e9a0b7e8aab9e8b28aefbdb9e89386e8b188e8a892e5a4b2efbdbdefbdb6e9aba3e8b2bbefbdbdefbdb9e89cbfefbdb142
UHC ?夭貊??誹貊?蓆豈?????費????B 00111111111010001110110011011000111001110011111100111111110111101010011011011000111001110011111111100000101101101101000111000010001111110011111100111111001111110011111111011110101010000011111100111111001111110011111101000010 3fe8ecd8e73f3fdea6d8e73fe0b6d1c23f3f3f3f3fdea83f3f3f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)