To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鸚??柔??轅??????辱?…猷??喩?? 1110101001011111001111110011111110001111010111110011111100111111111001110111011000111111001111110011111100111111001111110011111110010000010010100011111110000001011000111001011101010001001111110011111110011010011001110011111100111111 ea5f3f3f8f5f3f3fe7763f3f3f3f3f3f904a3f816397513f3f9a673f3f
EUC-JP 鸚??柔??轅??????辱?…猷??喩?? 1111001111000000001111110011111110111101110000000011111100111111111011011101011100111111001111110011111100111111001111110011111110111111101010110011111110100001110001001100110110110010001111110011111111010011110010000011111100111111 f3c03f3fbdc03f3fedd73f3f3f3f3f3fbfab3fa1c4cdb23f3fd3c83f3f
UTF-8 鸚쒖룆柔꾢젆轅대젧捻믍됰늾辱됰…猷싨략喩쀯폁 111010011011100010011010111011001001001010010110111010111010001110000110111001101001111110010100111010101011111010100010111011001010000010000110111010001011110110000101111010111000110010000000111011001010000010100111111011111010011010100100111010111010111110001101111010111001000010110000111010111000101010111110111010001011111010110001111010111001000010110000111000101000000010100110111001111000110010110111111011001000101110101000111010111001111010110101111001011001011010101001111011001000000010101111111011011000111110000001 e9b89aec9296eba386e69f94eabea2eca086e8bd85eb8c80eca0a7efa6a4ebaf8deb90b0eb8abee8beb1eb90b0e280a6e78cb7ec8ba8eb9eb5e596a9ec80afed8f81
UHC 鸚쒖룆柔꾢젆轅대젧捻믍됰늾辱됰…猷싨략喩쀯폁 1110010110100100100111001110110010001111100001011110101011110101100001001110010110100000100010011110101010111111101101001110101110100000100111111110011011110111100100101101000110001001111010111000100010000111111010011011010010001001111010111010000110100110111010111010001110011010111001101011011110101011111010101110011110010111111011111011110010010000 e5a49cec8f85eaf584e5a089eabfb4eba09fe6f792d189eb8887e9b489eba1a6eba39ae6b7abeae797efbc90

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)