To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??裕?┷臾???γ???┷ 111000011001111100111111001111111001011101010100001111111000010010111000111001000110101100111111001111110011111110000011110000010011111100111111001111111000010010111000 e19f3f3f97543f84b8e46b3f3f3f83c13f3f3f84b8
EUC-JP 癲??裕?┷臾???γ???┷ 111000101010000100111111001111111100110110110101001111111010100010111010111001111100110000111111001111110011111110100110110000110011111100111111001111111010100010111010 e2a13f3fcdb53fa8bae7cc3f3f3fa6c33f3f3fa8ba
UTF-8 癲븍쵉裕낉┷臾뗭떼筽γ깷痢믭┷ 1110011110011001101100101110101110111000100011011110110010110101100010011110100010100011100101011110101110000010100010011110001010010100101101111110100010000111101111101110101110010111101011011110101110010110101111001110011110101101101111011100111010110011111010101011100110110111111011111010011110100101111010111010111110101101111000101001010010110111 e799b2ebb88decb589e8a395eb8289e294b7e887beeb97adeb96bce7adbdceb3eab9b7efa7a5ebafade294b7
UHC 癲븍쵉裕낉┷臾뗭떼筽γ깷痢믭┷ 111011111010011010111010111010111010110010001011111010111010111010000101111011111010011010111010111010111010110010001011111011001011011010111100111010001010010010100101111000111000001110100101111011001011100010010010111011111010011010111010 efa6baebac8bebae85efa6baebac8becb6bce8a4a5e383a5ecb892efa6ba

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)