To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 咀?魚??????刻咀?魚???????? 1001100111110000001111111000101110011011001111110011111100111111001111110011111100111111100011011000111110011001111100000011111110001011100110110011111100111111001111110011111100111111001111110011111100111111 99f03f8b9b3f3f3f3f3f3f8d8f99f03f8b9b3f3f3f3f3f3f3f3f
EUC-JP 咀?魚??????刻咀?魚???????? 1101001011110010001111111011010111111011001111110011111100111111001111110011111100111111101110011110111111010010111100100011111110110101111110110011111100111111001111110011111100111111001111110011111100111111 d2f23fb5fb3f3f3f3f3f3fb9efd2f23fb5fb3f3f3f3f3f3f3f3f
UTF-8 咀렞魚판뀜렒띤렗뀄刻咀렞魚판뀜렒띤렗곡렰굻 111001011001001010000000111010111010000010011110111010011010110110011010111011011000110010010000111010111000000010011100111010111010000010010010111010111001110110100100111010111010000010010111111010111000000010000100111001011000100010111011111001011001001010000000111010111010000010011110111010011010110110011010111011011000110010010000111010111000000010011100111010111010000010010010111010111001110110100100111010111010000010010111111010101011001110100001111010111010000010110000111010101011010110111011 e59280eba09ee9ad9aed8c90eb809ceba092eb9da4eba097eb8084e588bbe59280eba09ee9ad9aed8c90eb809ceba092eb9da4eba097eab3a1eba0b0eab5bb
UHC 咀렞魚판뀜렒띤렗뀄刻咀렞魚판뀜렒띤렗곡렰굻 111011101011101010001110101011111110010111100000110001101100011110110010111100011000111010100111101101101110110110001110101011001011001011101101110010101011111011101110101110101000111010101111111001011110000011000110110001111011001011110001100011101010011110110110111011011000111010101100101100001110111010001110101111011011000110111111 eeba8eafe5e0c6c7b2f18ea7b6ed8eacb2edcabeeeba8eafe5e0c6c7b2f18ea7b6ed8eacb0ee8ebdb1bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)