To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 汚?????鶯??諺??鸚ф?映у?節→?^ 10001001100110000011111100111111001111110011111100111111111010011111001000111111001111111000110010111111001111110011111111101010010111111000010010000110001111111000100101100110100001001000010100111111100100001101111110000001101010000011111101011110 89983f3f3f3f3fe9f23f3f8cbf3f3fea5f84863f896684853f90df81a83f5e
EUC-JP 汚?????鶯??諺??鸚ф?映у?節→?^ 10110001111110000011111100111111001111110011111100111111111100101111010000111111001111111011100011000001001111110011111111110011110000001010011111100110001111111011000111000111101001111110010100111111110000001110000110100010101010100011111101011110 b1f83f3f3f3f3ff2f43f3fb8c13f3ff3c0a7e63fb1c7a7e53fc0e1a2aa3f5e
UTF-8 汚뗥ㄼ呂양괵鶯뚳숯諺듣윀鸚ф씭映у즺節→럦^ 1110011010110001100110101110101110010111101001011110001110000100101111001110111110100110100000001110110010010110100100011110101010110100101101011110100110110110101011111110101110011010101100111110110010001000101011111110100010101011101110101110101110010011101000111110110010011100100000001110100110111000100110101101000110000100111011001001010010101101111001101001100010100000110100011000001111101100101001101011101011100111101011111000000011100010100001101001001011101011100111111010011001011110 e6b19aeb97a5e384bcefa680ec9691eab4b5e9b6afeb9ab3ec88afe8abbaeb93a3ec9c80e9b89ad184ec94ade698a0d183eca6bae7af80e28692eb9fa65e
UHC 汚뗥ㄼ呂양괵鶯뚳숯諺듣윀鸚ф씭映у즺節→럦^ 11100111111111011000101111100101101001001010110011100101111110111011111011100111101100011010110011100101101000111000110011101111101111011010000111100101111011001011010111101000100111111000101111100101101001001010110011100110100111011011111011100111101100011010110011100101101000111000110011101111101111011010000111100110100011101000100101011110 e7fd8be5a4ace5fbbee7b1ace5a38cefbda1e5ecb5e89f8be5a4ace69dbee7b1ace5a38cefbda1e68e895e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)