To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???^h???^fN}???^h???^fN{^ 00111111001111110011111101011110011010000011111100111111001111110101111001100110010011100111110100111111001111110011111101011110011010000011111100111111001111110101111001100110010011100111101101011110 3f3f3f5e683f3f3f5e664e7d3f3f3f5e683f3f3f5e664e7b5e
SJIS-WIN 荼暑砂^h荼暑砂^fN}荼暑砂^h荼暑砂^fN{^ 11100100101101101000111110001011100011011011101101011110011010001110010010110110100011111000101110001101101110110101111001100110010011100111110111100100101101101000111110001011100011011011101101011110011010001110010010110110100011111000101110001101101110110101111001100110010011100111101101011110 e4b68f8b8dbb5e68e4b68f8b8dbb5e664e7de4b68f8b8dbb5e68e4b68f8b8dbb5e664e7b5e
EUC-JP 荼暑砂^h荼暑砂^fN}荼暑砂^h荼暑砂^fN{^ 11101000101110001011110111101011101110101011110101011110011010001110100010111000101111011110101110111010101111010101111001100110010011100111110111101000101110001011110111101011101110101011110101011110011010001110100010111000101111011110101110111010101111010101111001100110010011100111101101011110 e8b8bdebbabd5e68e8b8bdebbabd5e664e7de8b8bdebbabd5e68e8b8bdebbabd5e664e7b5e
UTF-8 荼暑砂^h荼暑砂^fN}荼暑砂^h荼暑砂^fN{^ 11101000100011011011110011100110100110101001000111100111101000001000001001011110011010001110100010001101101111001110011010011010100100011110011110100000100000100101111001100110010011100111110111101000100011011011110011100110100110101001000111100111101000001000001001011110011010001110100010001101101111001110011010011010100100011110011110100000100000100101111001100110010011100111101101011110 e88dbce69a91e7a0825e68e88dbce69a91e7a0825e664e7de88dbce69a91e7a0825e68e88dbce69a91e7a0825e664e7b5e
UHC ?暑砂^h?暑砂^fN}?暑砂^h?暑砂^fN{^ 001111111101111111110100110111101110001101011110011010000011111111011111111101001101111011100011010111100110011001001110011111010011111111011111111101001101111011100011010111100110100000111111110111111111010011011110111000110101111001100110010011100111101101011110 3fdff4dee35e683fdff4dee35e664e7d3fdff4dee35e683fdff4dee35e664e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)