To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN ??云???葬謗 0011111100111111100010010101110100111111001111110011111110010001100100101110011010001110 3f3f895d3f3f3f9192e68e
EUC-JP ??云???葬謗 0011111100111111101100011011111000111111001111110011111111000001111100101110101111101110 3f3fb1be3f3f3fc1f2ebee
UTF-8 梨렗云닺렞렗葬謗 111011111010011110100010111010111010000010010111111001001011101010010001111010111000101110111010111010111010000010011110111010111010000010010111111010001001000110101100111010001010110010010111 efa7a2eba097e4ba91eb8bbaeba09eeba097e891ace8ac97
UHC 梨렗云닺렞렗葬謗 11101100101100011000111010101100111010011111011010110100111010001000111010101111100011101010110011101101111101111101101110111111 ecb18eace9f6b4e88eaf8eacedf7dbbf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)