To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????h?????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110110100000111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f683f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??怨烽∧??疑??h??怨烽∧??疑?? 0011111100111111100010011000010111100000100000101000000111001000001111110011111110001011010111100011111100111111011010000011111100111111100010011000010111100000100000101000000111001000001111110011111110001011010111100011111100111111 3f3f8985e08281c83f3f8b5e3f3f683f3f8985e08281c83f3f8b5e3f3f
EUC-JP ??怨烽∧??疑??h??怨烽∧??疑?? 0011111100111111101100011110010111011111111000101010001011001010001111110011111110110101101111110011111100111111011010000011111100111111101100011110010111011111111000101010001011001010001111110011111110110101101111110011111100111111 3f3fb1e5dfe2a2ca3f3fb5bf3f3f683f3fb1e5dfe2a2ca3f3fb5bf3f3f
UTF-8 欌렪怨烽∧欌렪疑양뱌h欌렪怨烽∧欌렪疑양뱌 11100110101011001000110011101011101000001010101011100110100000001010100011100111100000111011110111100010100010001010011111100110101011001000110011101011101000001010101011100111100101101001000111101100100101101001000111101011101100011000110001101000111001101010110010001100111010111010000010101010111001101000000010101000111001111000001110111101111000101000100010100111111001101010110010001100111010111010000010101010111001111001011010010001111011001001011010010001111010111011000110001100 e6ac8ceba0aae680a8e783bde288a7e6ac8ceba0aae79691ec9691ebb18c68e6ac8ceba0aae680a8e783bde288a7e6ac8ceba0aae79691ec9691ebb18c
UHC 欌렪怨烽∧欌렪疑양뱌h欌렪怨烽∧欌렪疑양뱌 1110110111101011100011101011100011101010101100111101110011101011101000011111110011101101111010111000111010111000111010111111011110111110111001111011100111110010011010001110110111101011100011101011100011101010101100111101110011101011101000011111110011101101111010111000111010111000111010111111011110111110111001111011100111110010 edeb8eb8eab3dceba1fcedeb8eb8ebf7bee7b9f268edeb8eb8eab3dceba1fcedeb8eb8ebf7bee7b9f2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)