To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN ?舅???舅??肌 001111111110010001101110001111110011111100111111111001000110111000111111001111111001010010100111 3fe46e3f3f3fe46e3f3f94a7
EUC-JP ?舅???舅??肌 001111111110011111001111001111110011111100111111111001111100111100111111001111111100100010101001 3fe7cf3f3f3fe7cf3f3fc8a9
UTF-8 렺舅렺후爐舅렺후肌 111010111010000010111010111010001000100010000101111010111010000010111010111011011001101110000100111011111010010010110010111010001000100010000101111010111010000010111010111011011001101110000100111010001000001010001100 eba0bae88885eba0baed9b84efa4b2e88885eba0baed9b84e8828c
UHC 렺舅렺후爐舅렺후肌 100011101100001011001111110000001000111011000010110010001100010011010010110001001100111111000000100011101100001011001000110001001101000110111111 8ec2cfc08ec2c8c4d2c4cfc08ec2c8c4d1bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)