To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 第頃??棕頃??琮??敎??甑頃??繒?孺 10010001111001101000110110100000001111110011111110011110101000011000110110100000001111110011111111111011011010100011111100111111111110101100110100111111001111111000110110011001100011011010000000111111001111111111101110001111001111111001101101111101 91e68da03f3f9ea18da03f3ffb6a3f3ffacd3f3f8d998da03f3ffb8f3f9b7d
EUC-JP 第頃??棕頃??琮?邕???甑頃??繒?孺 11000010111010001011101010100010001111110011111111011100101000111011101010100010001111110011111110001111110011001011001000111111100011111110000111101101001111110011111100111111101110011111100110111010101000100011111100111111100011111101010011010100001111111101010111011110 c2e8baa23f3fdca3baa23f3f8fccb23f8fe1ed3f3f3fb9f9baa23f3f8fd4d43fd5de
UTF-8 第頃렰렫棕頃렰렔琮렗邕敎렢렕甑頃렰렔繒렗孺 111001111010110010101100111010011010000010000011111010111010000010110000111010111010000010101011111001101010001110010101111010011010000010000011111010111010000010110000111010111010000010010100111001111001000010101110111010111010000010010111111010011000001010010101111001101001010110001110111010111010000010100010111010111010000010010101111001111001010010010001111010011010000010000011111010111010000010110000111010111010000010010100111001111011100110010010111010111010000010010111111001011010110110111010 e7acace9a083eba0b0eba0abe6a395e9a083eba0b0eba094e790aeeba097e98295e6958eeba0a2eba095e79491e9a083eba0b0eba094e7b992eba097e5adba
UHC 第頃렰렫棕頃렰렔琮렗邕敎렢렕甑頃렰렔繒렗孺 111100001010111111001100111100011000111010111101100011101011100111110000111101111100110011110001100011101011110110001110101010011111000011111001100011101010110011101000101110111100111011100111100011101011001110001110101010101111000111110111110011001111000110001110101111011000111010101001111100011111100110001110101011001110101011101000 f0afccf18ebd8eb9f0f7ccf18ebd8ea9f0f98eace8bbcee78eb38eaaf1f7ccf18ebd8ea9f1f98eaceae8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)