To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN ?遐荊????げ遼B 0011111111100111101000001000110001110100001111110011111100111111001111111000001010110000100101111100100101000010 3fe7a08c743f3f3f3f82b097c942
EUC-JP ?遐荊????げ遼B 0011111111101110101000101011011111010101001111110011111100111111001111111010010010110010110011101100101101000010 3feea2b7d53f3f3f3fa4b2cecb42
UTF-8 뤋遐荊꾕죴샘폼げ遼B 11101011101001001000101111101001100000011001000011101000100011011000101011101010101111101001010111101100101000111011010011101100100000111001100011101101100011111011110011100011100000011001001011101001100000011011110001000010 eba48be98190e88d8aeabe95eca3b4ec8398ed8fbce38192e981bc42
UHC 뤋遐荊꾕죴샘폼げ遼B 10001111101110111111100111000110111110111010101010110010110101111010000110001111101110111111100111000110111110111010101010110010110101111010000101000010 8fbbf9c6fbaab2d7a18fbbf9c6fbaab2d7a142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)