To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 縡?怨棒?寃???怨棒?寃? 111000110111000100111111100010011000010110010110010111110011111110011011100000110011111100111111001111111000100110000101100101100101111100111111100110111000001100111111 e3713f8985965f3f9b833f3f3f8985965f3f9b833f
EUC-JP 縡?怨棒?寃???怨棒?寃? 111001011101001000111111101100011110010111001011110000000011111111010101111000110011111100111111001111111011000111100101110010111100000000111111110101011110001100111111 e5d23fb1e5cbc03fd5e33f3f3fb1e5cbc03fd5e33f
UTF-8 縡렕怨棒㉢寃닻렖렕怨棒㉢寃냄 111001111011100010100001111010111010000010010101111001101000000010101000111001101010001110010010111000111000100110100010111001011010111110000011111010111000101110111011111010111010000010010110111010111010000010010101111001101000000010101000111001101010001110010010111000111000100110100010111001011010111110000011111010111000001110000100 e7b8a1eba095e680a8e6a392e389a2e5af83eb8bbbeba096eba095e680a8e6a392e389a2e5af83eb8384
UHC 縡렕怨棒㉢寃닻렖렕怨棒㉢寃냄 11101110101011011000111010101010111010101011001111011100111010101010100010110011111010101011001010110100111010011000111010101011100011101010101011101010101100111101110011101010101010001011001111101010101100101011001110111111 eead8eaaeab3dceaa8b3eab2b4e98eab8eaaeab3dceaa8b3eab2b3bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)