To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????Lh?????????L 001111110011111100111111001111110011111100111111001111110011111100111111010011000110100000111111001111110011111100111111001111110011111100111111001111110011111101001100 3f3f3f3f3f3f3f3f3f4c683f3f3f3f3f3f3f3f3f4c
SJIS-WIN 汚?????節℡?Lh汚?????節℡?L 100010011001100000111111001111110011111100111111001111111001000011011111100001111000010000111111010011000110100010001001100110000011111100111111001111110011111100111111100100001101111110000111100001000011111101001100 89983f3f3f3f3f90df87843f4c6889983f3f3f3f3f90df87843f4c
EUC-JP 汚?????節??Lh汚?????節??L 10110001111110000011111100111111001111110011111100111111110000001110000100111111001111110100110001101000101100011111100000111111001111110011111100111111001111111100000011100001001111110011111101001100 b1f83f3f3f3f3fc0e13f3f4c68b1f83f3f3f3f3fc0e13f3f4c
UTF-8 汚억슬樂됮줁節℡맆Lh汚억슬樂됮줁節℡맆L 111001101011000110011010111011001001011010110101111011001000101010101100111011111010011010111111111010111001000010101110111011001010010010000001111001111010111110000000111000101000010010100001111010111010011110000110010011000110100011100110101100011001101011101100100101101011010111101100100010101010110011101111101001101011111111101011100100001010111011101100101001001000000111100111101011111000000011100010100001001010000111101011101001111000011001001100 e6b19aec96b5ec8aacefa6bfeb90aeeca481e7af80e284a1eba7864c68e6b19aec96b5ec8aacefa6bfeb90aeeca481e7af80e284a1eba7864c
UHC 汚억슬樂됮줁節℡맆Lh汚억슬樂됮줁節℡맆L 111001111111110110111110111011111011110110111101111010001111100110001001111010011010000110011000111011111011110110100010111001011001000010100000010011000110100011100111111111011011111011101111101111011011110111101000111110011000100111101001101000011001100011101111101111011010001011100101100100001010000001001100 e7fdbeefbdbde8f989e9a198efbda2e590a04c68e7fdbeefbdbde8f989e9a198efbda2e590a04c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)