To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????M??????????\ 00111111001111110011111100111111001111110011111100111111001111110011111100111111010011010011111100111111001111110011111100111111001111110011111100111111001111110011111101011100 3f3f3f3f3f3f3f3f3f3f4d3f3f3f3f3f3f3f3f3f3f5c
SJIS-WIN 雍夜㈲謔夐賢蜊ィ隨ケM雍夜㈲謔夐賢蜊ィ隨ケ\ 1110100010110100100101101110100110000111100010111110011010000010100110101110100110001100101010111110010110001101101010001110011110101100101110010100110111101000101101001001011011101001100001111000101111100110100000101001101011101001100011001010101111100101100011011010100011100111101011001011100101011100 e8b496e9878be6829ae98cabe58da8e7acb94de8b496e9878be6829ae98cabe58da8e7acb95c
EUC-JP 雍夜?謔夐賢蜊ィ隨ケM雍夜?謔夐賢蜊ィ隨ケ\ 11110000101101101100110011101011001111111110101111100010110101001110101110111000101011011110100111101101100011101010100011101110101011101000111010111001010011011111000010110110110011001110101100111111111010111110001011010100111010111011100010101101111010011110110110001110101010001110111010101110100011101011100101011100 f0b6cceb3febe2d4ebb8ade9ed8ea8eeae8eb94df0b6cceb3febe2d4ebb8ade9ed8ea8eeae8eb95c
UTF-8 雍夜㈲謔夐賢蜊ィ隨ケM雍夜㈲謔夐賢蜊ィ隨ケ\ 1110100110011011100011011110010110100100100111001110001110001000101100101110100010101100100101001110010110100100100100001110100010110011101000101110100010011100100010101110111110111101101010001110100110011010101010001110111110111101101110010100110111101001100110111000110111100101101001001001110011100011100010001011001011101000101011001001010011100101101001001001000011101000101100111010001011101000100111001000101011101111101111011010100011101001100110101010100011101111101111011011100101011100 e99b8de5a49ce388b2e8ac94e5a490e8b3a2e89c8aefbda8e99aa8efbdb94de99b8de5a49ce388b2e8ac94e5a490e8b3a2e89c8aefbda8e99aa8efbdb95c
UHC 雍夜?謔?賢??隨?M雍夜?謔?賢??隨?\ 1110100010111100111001011010100000111111111110011100110000111111111110101110011100111111001111111110001011001011001111110100110111101000101111001110010110101000001111111111100111001100001111111111101011100111001111110011111111100010110010110011111101011100 e8bce5a83ff9cc3ffae73f3fe2cb3f4de8bce5a83ff9cc3ffae73f3fe2cb3f5c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)