To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????R????^[????R????^[^ 0011111100111111001111110011111101010010001111110011111100111111001111110101111001011011001111110011111100111111001111110101001000111111001111110011111100111111010111100101101101011110 3f3f3f3f523f3f3f3f5e5b3f3f3f3f523f3f3f3f5e5b5e
SJIS-WIN 闡「蝨ーR闡「蝨ー^[闡「蝨ーR闡「蝨ー^[^ 11101000100100011010001011100101100111001011000001010010111010001001000110100010111001011001110010110000010111100101101111101000100100011010001011100101100111001011000001010010111010001001000110100010111001011001110010110000010111100101101101011110 e891a2e59cb052e891a2e59cb05e5be891a2e59cb052e891a2e59cb05e5b5e
EUC-JP 闡「蝨ーR闡「蝨ー^[闡「蝨ーR闡「蝨ー^[^ 111011111111000110001110101000101110100111111100100011101011000001010010111011111111000110001110101000101110100111111100100011101011000001011110010110111110111111110001100011101010001011101001111111001000111010110000010100101110111111110001100011101010001011101001111111001000111010110000010111100101101101011110 eff18ea2e9fc8eb052eff18ea2e9fc8eb05e5beff18ea2e9fc8eb052eff18ea2e9fc8eb05e5b5e
UTF-8 闡「蝨ーR闡「蝨ー^[闡「蝨ーR闡「蝨ー^[^ 11101001100101111010000111101111101111011010001011101000100111011010100011101111101111011011000001010010111010011001011110100001111011111011110110100010111010001001110110101000111011111011110110110000010111100101101111101001100101111010000111101111101111011010001011101000100111011010100011101111101111011011000001010010111010011001011110100001111011111011110110100010111010001001110110101000111011111011110110110000010111100101101101011110 e997a1efbda2e89da8efbdb052e997a1efbda2e89da8efbdb05e5be997a1efbda2e89da8efbdb052e997a1efbda2e89da8efbdb05e5b5e
UHC 闡?蝨?R闡?蝨?^[闡?蝨?R闡?蝨?^[^ 11110100110001010011111111100011101001000011111101010010111101001100010100111111111000111010010000111111010111100101101111110100110001010011111111100011101001000011111101010010111101001100010100111111111000111010010000111111010111100101101101011110 f4c53fe3a43f52f4c53fe3a43f5e5bf4c53fe3a43f52f4c53fe3a43f5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)