To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 塋よ???オ兀??[塋よ???オ兀??[^ 1001101011001000100000101110011000111111001111110011111110000011010010011001100101011001001111110011111101011011100110101100100010000010111001100011111100111111001111111000001101001001100110010101100100111111001111110101101101011110 9ac882e63f3f3f834999593f3f5b9ac882e63f3f3f834999593f3f5b5e
EUC-JP 塋よ???オ兀??[塋よ???オ兀??[^ 1101010011001010101001001110100000111111001111110011111110100101101010101101000110111010001111110011111101011011110101001100101010100100111010000011111100111111001111111010010110101010110100011011101000111111001111110101101101011110 d4caa4e83f3f3fa5aad1ba3f3f5bd4caa4e83f3f3fa5aad1ba3f3f5b5e
UTF-8 塋よ큹嶪뤹オ兀덃뿈[塋よ큹嶪뤹オ兀덃뿈[^ 111001011010000110001011111000111000001010001000111011011000000110111001111001011011011010101010111010111010010010111001111000111000001010101010111001011000010110000000111010111000110110000011111010111011111110001000010110111110010110100001100010111110001110000010100010001110110110000001101110011110010110110110101010101110101110100100101110011110001110000010101010101110010110000101100000001110101110001101100000111110101110111111100010000101101101011110 e5a18be38288ed81b9e5b6aaeba4b9e382aae58580eb8d83ebbf885be5a18be38288ed81b9e5b6aaeba4b9e382aae58580eb8d83ebbf885b5e
UHC 塋よ큹嶪뤹オ兀덃뿈[塋よ큹嶪뤹オ兀덃뿈[^ 111001111010101110101010111010001011010010001000111001011111010110001111111001111010101110101010111010001011010010001000111001101001011110001111010110111110011110101011101010101110100010110100100010001110010111110101100011111110011110101011101010101110100010110100100010001110011010010111100011110101101101011110 e7abaae8b488e5f58fe7abaae8b488e6978f5be7abaae8b488e5f58fe7abaae8b488e6978f5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)