To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???唯??轅??筌 00111111001111110011111110010111010000100011111100111111111001110111011000111111001111111110001010100011 3f3f3f97423f3fe7763f3fe2a3
EUC-JP 艅??唯??轅??筌 100011111101011011111101001111110011111111001101101000110011111100111111111011011101011100111111001111111110010010100101 8fd6fd3f3fcda33f3fedd73f3fe4a5
UTF-8 艅덈냱唯껆깗轅깅닀筌 111010001000100110000101111010111000110110001000111010111000001110110001111001011001010010101111111010101011101110000110111010101011100110010111111010001011110110000101111010101011100110000101111010111000101110000000111001111010110110001100 e88985eb8d88eb83b1e594afeabb86eab997e8bd85eab985eb8b80e7ad8c
UHC 艅덈냱唯껆깗轅깅닀筌 1110011010101001100010001110101110000110100000011110101011100110100000111110011110000011100011111110101010111111101100011110101110001000100010011110111110100111 e6a988eb8681eae683e7838feabfb1eb8889efa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)