To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鈺??節??蘖??與?????鈺??節 11111011110001000011111100111111100100001101111100111111001111111001111101010000001111110011111111100100011011110011111100111111001111110011111100111111111110111100010000111111001111111001000011011111 fbc43f3f90df3f3f9f503f3fe46f3f3f3f3f3ffbc43f3f90df
EUC-JP 鈺??節??蘖??與??旿??鈺??節 1000111111100011110101010011111100111111110000001110000100111111001111111101110110110001001111110011111111100111110100000011111100111111100011111100000111110100001111110011111110001111111000111101010100111111001111111100000011100001 8fe3d53f3fc0e13f3fddb13f3fe7d03f3f8fc1f43f3f8fe3d53f3fc0e1
UTF-8 鈺싮죱節닷윜蘖쀨왊與딀릹旿딉슁鈺싮죱節 111010011000100010111010111011001000101110101110111011001010001110110001111001111010111110000000111010111000101110110111111011001001110010011100111010001001100010010110111011001000000010101000111011001001100110001010111010001000100010000111111010111001010010000000111010111010011010111001111001101001011110111111111010111001010010001001111011001000101010000001111010011000100010111010111011001000101110101110111011001010001110110001111001111010111110000000 e988baec8baeeca3b1e7af80eb8bb7ec9c9ce89896ec80a8ec998ae88887eb9480eba6b9e697bfeb9489ec8a81e988baec8baeeca3b1e7af80
UHC 鈺싮죱節닷윜蘖쀨왊與딀릹旿딉슁鈺싮죱節 1110100010101101100110101110100110100001100011001110111110111101101101001110010110011111100111111110010111101110100101111110100010011110101110111110011010101000100010101110011010010000100101111110011111111010100010101110111110111101101100111110100010101101100110101110100110100001100011001110111110111101 e8ad9ae9a18cefbdb4e59f9fe5ee97e89ebbe6a88ae69097e7fa8aefbdb3e8ad9ae9a18cefbd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)