To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D\ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011100 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445c
SJIS-WIN ?脹??脹????D?脹??脹????D\ 00111111100100101010111100111111001111111001001010101111001111110011111100111111001111110100010000111111100100101010111100111111001111111001001010101111001111110011111100111111001111110100010001011100 3f92af3f3f92af3f3f3f3f443f92af3f3f92af3f3f3f3f445c
EUC-JP ?脹??脹????D?脹??脹????D\ 00111111110001001011000100111111001111111100010010110001001111110011111100111111001111110100010000111111110001001011000100111111001111111100010010110001001111110011111100111111001111110100010001011100 3fc4b13f3fc4b13f3f3f3f443fc4b13f3fc4b13f3f3f3f445c
UTF-8 뤋脹큌뤋脹콓샘ㅿ龜D뤋脹큌뤋脹콓샘ㅿ龜D\ 111010111010010010001011111010001000010010111001111011011000000110001100111010111010010010001011111010001000010010111001111011001011110110010011111011001000001110011000111000111000010110111111111011111010010010001000010001001110101110100100100010111110100010000100101110011110110110000001100011001110101110100100100010111110100010000100101110011110110010111101100100111110110010000011100110001110001110000101101111111110111110100100100010000100010001011100 eba48be884b9ed818ceba48be884b9ecbd93ec8398e385bfefa48844eba48be884b9ed818ceba48be884b9ecbd93ec8398e385bfefa488445c
UHC 뤋脹큌뤋脹콓샘ㅿ龜D뤋脹큌뤋脹콓샘ㅿ龜D\ 100011111011101111110011111011001011010001010111100011111011101111110011111011001011000110001111101110111111100110100100111011111101000010111000010001001000111110111011111100111110110010110100010101111000111110111011111100111110110010110001100011111011101111111001101001001110111111010000101110000100010001011100 8fbbf3ecb4578fbbf3ecb18fbbf9a4efd0b8448fbbf3ecb4578fbbf3ecb18fbbf9a4efd0b8445c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)