To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN 役??椅??儒??B 10010110111100000011111100111111100010001101011000111111001111111000111011110010001111110011111101000010 96f03f3f88d63f3f8ef23f3f42
EUC-JP 役??椅??儒??B 11001100111100100011111100111111101100001101100000111111001111111011110011110100001111110011111101000010 ccf23f3fb0d83f3fbcf43f3f42
UTF-8 役대끋椅ⓩ뎬儒쎌뿨B 11100101101111011011100111101011100011001000000011101011100000011000101111100110101001001000010111100010100100111010100111101011100011101010110011100101100001001001001011101100100011101000110011101011101111111010100001000010 e5bdb9eb8c80eb818be6a485e293a9eb8eace58492ec8e8cebbfa842
UHC 役대끋椅ⓩ뎬儒쎌뿨B 11100110101101011011010011101011100001011011110111101011111101011010100011100110101101011011010011101010111000111011110111101100100101111010100001000010 e6b5b4eb85bdebf5a8e6b5b4eae3bdec97a842

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)