To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ????????????而?????姨〓????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111110001110101001110011111100111111001111110011111100111111100110110100100010000001101011000011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f8ea73f3f3f3f3f9b4881ac3f3f3f3f42
EUC-JP ????????????而?????姨〓????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111110111100101010010011111100111111001111110011111100111111110101011010100110100010101011100011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3fbca93f3f3f3f3fd5a9a2ae3f3f3f3f42
UTF-8 溜삳젧溜븍젷溜삭쵗溜볥졎而대젘溜븐뀛姨〓젻溜븍젫B 11101111101001111000101111101100100000101011001111101100101000001010011111101111101001111000101111101011101110001000110111101100101000001011011111101111101001111000101111101100100000101010110111101100101101011001011111101111101001111000101111101011101100111010010111101100101000011000111011101000100000001000110011101011100011001000000011101100101000001001100011101111101001111000101111101011101110001001000011101011100000001001101111100101101001111010100011100011100000001001001111101100101000001011101111101111101001111000101111101011101110001000110111101100101000001010101101000010 efa78bec82b3eca0a7efa78bebb88deca0b7efa78bec82adecb597efa78bebb3a5eca18ee8808ceb8c80eca098efa78bebb890eb809be5a7a8e38093eca0bbefa78bebb88deca0ab42
UHC 溜삳젧溜븍젷溜삭쵗溜볥졎而대젘溜븐뀛姨〓젻溜븍젫B 11101010111111101011101111101011101000001001111111101010111111101011101011101011101000001010101111101010111111101011101111101000101011001001100111101010111111101001001111101011101000001011101111101100101110111011010011101011101000001001010011101010111111101011101011101100100001011001010011101100101010011010000111101011101000001010111011101010111111101011101011101011101000001010001101000010 eafebbeba09feafebaeba0abeafebbe8ac99eafe93eba0bbecbbb4eba094eafebaec8594eca9a1eba0aeeafebaeba0a342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)