To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 霄ーセト鞁サ質}霄ーセト鞁サ質{^ 11101000101110101011000011110001100011101011111011000100111010001101100111110011111011001011101110001110101111110111110111101000101110101011000011110001100011101011111011000100111010001101100111110011111011001011101110001110101111110111101101011110 e8bab0f18ebec4e8d9f3ecbb8ebf7de8bab0f18ebec4e8d9f3ecbb8ebf7b5e
EUC-JP 霄ー?セト鞁?サ質}霄ー?セト鞁?サ質{^ 1111000010111100100011101011000000111111100011101011111010001110110001001111000011011011001111111000111010111011101111001100000101111101111100001011110010001110101100000011111110001110101111101000111011000100111100001101101100111111100011101011101110111100110000010111101101011110 f0bc8eb03f8ebe8ec4f0db3f8ebbbcc17df0bc8eb03f8ebe8ec4f0db3f8ebbbcc17b5e
UTF-8 霄ーセト鞁サ質}霄ーセト鞁サ質{^ 111010011001110010000100111011111011110110110000111011101000010010001001111011111011110110111110111011111011111010000100111010011001111010000001111011101000101110011111111011111011110110111011111010001011001110101010011111011110100110011100100001001110111110111101101100001110111010000100100010011110111110111101101111101110111110111110100001001110100110011110100000011110111010001011100111111110111110111101101110111110100010110011101010100111101101011110 e99c84efbdb0ee8489efbdbeefbe84e99e81ee8b9fefbdbbe8b3aa7de99c84efbdb0ee8489efbdbeefbe84e99e81ee8b9fefbdbbe8b3aa7b5e
UHC ????????質}????????質{^ 0011111100111111001111110011111100111111001111110011111100111111111100101111010101111101001111110011111100111111001111110011111100111111001111110011111111110010111101010111101101011110 3f3f3f3f3f3f3f3ff2f57d3f3f3f3f3f3f3f3ff2f57b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)