To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 橈??梧?=???腰ч?節??鶯??要?━B 10011110111101000011111100111111100011001110011000111111100000011000000100111111001111110011111110001101100110001000010010001001001111111001000011011111001111110011111111101001111100100011111100111111100101110111011000111111100001001010101001000010 9ef43f3f8ce63f81813f3f3f8d9884893f90df3f3fe9f23f3f97763f84aa42
EUC-JP 橈??梧?=???腰ч?節??鶯??要?━B 11011100111101100011111100111111101110001110100000111111101000011110000100111111001111110011111110111001111110001010011111101001001111111100000011100001001111110011111111110010111101000011111100111111110011011101011100111111101010001010110001000010 dcf63f3fb8e83fa1e13f3f3fb9f8a7e93fc0e13f3ff2f43f3fcdd73fa8ac42
UTF-8 橈롳슴梧잌=咽뉔찕腰ч쐩節욤옝鶯썹럤要뺡━B 111001101010100110001000111010111010000110110011111011001000101010110100111001101010001010100111111011001001111010001100111011111011110010011101111011111010011010011110111010111000100110010100111011001011000010010101111010001000010110110000110100011000011111101100100100001010100111100111101011111000000011101100100110101010010011101100100110001001110111101001101101101010111111101100100011011011100111101011100111111010010011101000101001101000000111101011101110101010000111100010100101001000000101000010 e6a988eba1b3ec8ab4e6a2a7ec9e8cefbc9defa69eeb8994ecb095e885b0d187ec90a9e7af80ec9aa4ec989de9b6afec8db9eb9fa4e8a681ebbaa1e2948142
UHC 橈롳슴梧잌=咽뉔찕腰ч쐩節욤옝鶯썹럤要뺡━B 11101000111110101000111011101111101111011011111111100111111111001001111111100101101000111011110111100110111011001000011111101001101010011001010111101001101001101010110011101001100111001000111011101111101111011011111111101000100111101001111111100101101000111011110111100111100011101000011111101001101010011001010111101001101001101010110001000010 e8fa8eefbdbfe7fc9fe5a3bde6ec87e9a995e9a6ace99c8eefbdbfe89e9fe5a3bde78e87e9a995e9a6ac42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)