To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????幽??嵬??愉???????? 001111110011111100111111001111110011111100111111100101110100100000111111001111111001101111001010001111110011111110010110111110010011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f97483f3f9bca3f3f96f93f3f3f3f3f3f3f3f
EUC-JP ???靷??幽??嵬??愉???????? 0011111100111111001111111000111111100111101111010011111100111111110011011010100100111111001111111101011011001100001111110011111111001100111110110011111100111111001111110011111100111111001111110011111100111111 3f3f3f8fe7bd3f3fcda93f3fd6cc3f3fccfb3f3f3f3f3f3f3f3f
UTF-8 嶺뚮뿫靷숃굜幽뚯춷嵬됯퀣愉얕뮲紐꾨튂梨덄춯 111011111010011010101011111010111001101010101110111010111011111110101011111010011001110110110111111011001000100010000011111010101011010110011100111001011011100110111101111010111001101010101111111011001011011010110111111001011011010110101100111010111001000010101111111011011000000010100011111001101000010010001001111011001001011010010101111010111010111010110010111011111010011110001111111010101011111010101000111011011000101010000010111011111010011110100010111010111000110110000100111011001011011010101111 efa6abeb9aaeebbfabe99db7ec8883eab59ce5b9bdeb9aafecb6b7e5b5aceb90afed80a3e68489ec9695ebaeb2efa78feabea8ed8a82efa7a2eb8d84ecb6af
UHC 嶺뚮뿫靷숃굜幽뚯춷嵬됯퀣愉얕뮲紐꾨튂梨덄춯 111001111010110110001100111010111001011110101011111011001110011010011001111010001000001010000100111010101110101110001100111011001010110110010011111010001110001110001001111010101011001110010111111010101111000010111110111010001001001010111011111010111010101010000100111010111011100110011000111011001011000110001000111001111010110110001100 e7ad8ceb97abece699e88284eaeb8cecad93e8e389eab397eaf0bee892bbebaa84ebb998ecb188e7ad8c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)