To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???韋??蟻??瑤??松??艶b??? 0011111100111111001111111110100011101000001111110011111110001011011000010011111100111111111010101010001000111111001111111000111110111100001111110011111110001001100100001000001010000010001111110011111100111111 3f3f3fe8e83f3f8b613f3feaa23f3f8fbc3f3f899082823f3f3f
EUC-JP ???韋??蟻??瑤??松??艶b?嫄? 00111111001111110011111111110000111010100011111100111111101101011100001000111111001111111111010010100100001111110011111110111110101111100011111100111111101100011111000010100011111000100011111110001111101110101010000100111111 3f3f3ff0ea3f3fb5c23f3ff4a43f3fbebe3f3fb1f0a3e23f8fbaa13f
UTF-8 僚녹뼔韋귛푻蟻욎춻瑤녠쾯松쎌춻艶b뮧嫄턆 111011111010011010111011111010111000010110111001111010111011110010010100111010011001111110001011111010101011011110011011111011011001000110111011111010001001111110111011111011001001101010001110111011001011011010111011111001111001000110100100111010111000010110100000111011001011111010101111111001101001110110111110111011001000111010001100111011001011011010111011111010001000100110110110111011111011110110000010111010111010111010100111111001011010101110000100111011011000010010000110 efa6bbeb85b9ebbc94e99f8beab79bed91bbe89fbbec9a8eecb6bbe791a4eb85a0ecbeafe69dbeec8e8cecb6bbe889b6efbd82ebaea7e5ab84ed8486
UHC 僚녹뼔韋귛푻蟻욎춻瑤녠쾯松쎌춻艶b뮧嫄턆 11101000111010001011001111101100100101101001110011101010110111111000001011100101101111101000011111101011111111001001111011101100101011011001011111101000111111011011001111101010101100101000011011100001111001101011110111101100101011011001011111100110111111011010001111100010100100101011001011101010101100011011011001000010 e8e8b3ec969ceadf82e5be87ebfc9eecad97e8fdb3eab286e1e6bdecad97e6fda3e292b2eab1b642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)