To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??巽ζ?音??域??一??蹂λ?沃 11100001100111110011111100111111100100100100011010000011110001000011111110001001101110010011111100111111100010001110011000111111001111111000100011101010001111110011111111100110111110001000001111001001001111111001011110000000 e19f3f3f924683c43f89b93f3f88e63f3f88ea3f3fe6f883c93f9780
EUC-JP 癲??巽ζ?音??域??一??蹂λ?沃 11100010101000010011111100111111110000111010011110100110110001100011111110110010101110110011111100111111101100001110100000111111001111111011000011101100001111110011111111101100111110101010011011001011001111111100110111100000 e2a13f3fc3a7a6c63fb2bb3f3fb0e83f3fb0ec3f3fecfaa6cb3fcde0
UTF-8 癲딅끂巽ζ만音ㅻ눀域㏓쓹一뽪찄蹂λ룂沃 11100111100110011011001011101011100101001000010111101011100000011000001011100101101101111011110111001110101101101110101110100111100011001110100110011111101100111110001110000101101110111110101110001000100000001110010110011111100111111110001110001111100100111110110010010011101110011110010010111000100000001110101110111101101010101110110010110000100001001110100010111001100000101100111010111011111010111010001110000010111001101011001010000011 e799b2eb9485eb8182e5b7bdceb6eba78ce99fb3e385bbeb8880e59f9fe38f93ec93b9e4b880ebbdaaecb084e8b982cebbeba382e6b283
UHC 癲딅끂巽ζ만音ㅻ눀域㏓쓹一뽪찄蹂λ룂沃 1110111110100110100010101110101110000101101110001110000111011110101001011110011010111000101110001110101111100101101001001110101110000111101000011110011010110100101001111110101110011101100101011110110011101001100101101110011010101001100010001110101110110011101001011110101110001111100000111110100010101010 efa68aeb85b8e1dea5e6b8b8ebe5a4eb87a1e6b4a7eb9d95ece996e6a988ebb3a5eb8f83e8aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)