To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 厓れ?????у????厓れ?????у????^ 11111010100011011000001011101010001111110011111100111111001111110011111110000100100001010011111100111111001111110011111111111010100011011000001011101010001111110011111100111111001111110011111110000100100001010011111100111111001111110011111101011110 fa8d82ea3f3f3f3f3f84853f3f3f3ffa8d82ea3f3f3f3f3f84853f3f3f3f5e
EUC-JP 厓れ?????у????厓れ?????у????^ 100011111011010011000111101001001110110000111111001111110011111100111111001111111010011111100101001111110011111100111111001111111000111110110100110001111010010011101100001111110011111100111111001111110011111110100111111001010011111100111111001111110011111101011110 8fb4c7a4ec3f3f3f3f3fa7e53f3f3f3f8fb4c7a4ec3f3f3f3f3fa7e53f3f3f3f5e
UTF-8 厓れ웾溜곕젲寧у텩溜곕젌厓れ웾溜곕젲寧у텩溜곕젌^ 1110010110001110100100111110001110000010100011001110110010011011101111101110111110100111100010111110101010110011100101011110110010100000101100101110111110100110101010101101000110000011111011011000010110101001111011111010011110001011111010101011001110010101111011001010000010001100111001011000111010010011111000111000001010001100111011001001101110111110111011111010011110001011111010101011001110010101111011001010000010110010111011111010011010101010110100011000001111101101100001011010100111101111101001111000101111101010101100111001010111101100101000001000110001011110 e58e93e3828cec9bbeefa78beab395eca0b2efa6aad183ed85a9efa78beab395eca08ce58e93e3828cec9bbeefa78beab395eca0b2efa6aad183ed85a9efa78beab395eca08c5e
UHC 厓れ웾溜곕젲寧у텩溜곕젌厓れ웾溜곕젲寧у텩溜곕젌^ 11100100111011011010101011101100100111111000100111101010111111101011000011101011101000001010011011100111101011001010110011100101101101101001110111101010111111101011000011101011101000001000110111100100111011011010101011101100100111111000100111101010111111101011000011101011101000001010011011100111101011001010110011100101101101101001110111101010111111101011000011101011101000001000110101011110 e4edaaec9f89eafeb0eba0a6e7acace5b69deafeb0eba08de4edaaec9f89eafeb0eba0a6e7acace5b69deafeb0eba08d5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)