To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???飮??瑜??? 001111110011111100111111100111110101101000111111001111111110000011101111001111110011111100111111 3f3f3f9f5a3f3fe0ef3f3f3f
EUC-JP 艅??飮??瑜??? 1000111111010110111111010011111100111111110111011011101100111111001111111110000011110001001111110011111100111111 8fd6fd3f3fddbb3f3fe0f13f3f3f
UTF-8 艅덈엪飮꿨죰瑜낆돟嶺 111010001000100110000101111010111000110110001000111011001001011110101010111010011010001110101110111010101011111110101000111011001010001110110000111001111001000110011100111010111000001010000110111010111000111110011111111011111010011010101011 e88985eb8d88ec97aae9a3aeeabfa8eca3b0e7919ceb8286eb8f9fefa6ab
UHC 艅덈엪飮꿨죰瑜낆돟嶺 1110011010101001100010001110101110011110100000111110101111100110101100101110010110100001100010111110101110100101100001011110110010001001101001011110011110101101 e6a988eb9e83ebe6b2e5a18beba585ec89a5e7ad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)