To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??爰??蹂〓?域??瑜??兢意?? 111000011001111100111111001111111110000010100111001111110011111111100110111110001000000110101100001111111000100011100110001111110011111111100000111011110011111100111111100110010101110110001000110100110011111100111111 e19f3f3fe0a73f3fe6f881ac3f88e63f3fe0ef3f3f995d88d33f3f
EUC-JP 癲??爰??蹂〓?域??瑜??兢意?? 111000101010000100111111001111111110000010101001001111110011111111101100111110101010001010101110001111111011000011101000001111110011111111100000111100010011111100111111110100011011111010110000110101010011111100111111 e2a13f3fe0a93f3fecfaa2ae3fb0e83f3fe0f13f3fd1beb0d53f3f
UTF-8 癲ㅺ퓭爰귝끽蹂〓븶域뱀룇瑜쇔슫兢意㎬굜 111001111001100110110010111000111000010110111010111011011001001110101101111001111000100010110000111010101011011110011101111010111000000110111101111010001011100110000010111000111000000010010011111010111011100010110110111001011001111110011111111010111011000110000000111010111010001110000111111001111001000110011100111011001000011110010100111011001000101010101011111001011000010110100010111001101000010010001111111000111000111010101100111010101011010110011100 e799b2e385baed93ade788b0eab79deb81bde8b982e38093ebb8b6e59f9febb180eba387e7919cec8794ec8aabe585a2e6848fe38eaceab59c
UHC 癲ㅺ퓭爰귝끽蹂〓븶域뱀룇瑜쇔슫兢意㎬굜 1110111110100110101001001110101010111111100101001110101010111010100000101110011010110011101000111110101110110011101000011110101110010101100111111110011010110100101110011110110010001111100001101110101110100101101111001110010110011010101101001101000011100111111010111111001010100111111010001000001010000100 efa6a4eabf94eaba82e6b3a3ebb3a1eb959fe6b4b9ec8f86eba5bce59ab4d0e7ebf2a7e88284

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)