To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ??ぁ?丹悠ァ?谷??ぁ?丹悠ァ?梏^ 0011111100111111100000101001111100111111100100100100111110010111010010011000001101000000001111111001001001001010001111110011111110000010100111110011111110010010010011111001011101001001100000110100000000111111100111101000011101011110 3f3f829f3f924f974983403f924a3f3f829f3f924f974983403f9e875e
EUC-JP ??ぁ?丹悠ァ?谷??ぁ?丹悠ァ?梏^ 0011111100111111101001001010000100111111110000111011000011001101101010101010010110100001001111111100001110101011001111110011111110100100101000010011111111000011101100001100110110101010101001011010000100111111110110111110011101011110 3f3fa4a13fc3b0cdaaa5a13fc3ab3f3fa4a13fc3b0cdaaa5a13fdbe75e
UTF-8 룶쨵ぁ룶丹悠ァ룴谷룶쨵ぁ룶丹悠ァ룴梏^ 11101011101000111011011011101100101010001011010111100011100000011000000111101011101000111011011011100100101110001011100111100110100000101010000011100011100000101010000111101011101000111011010011101000101100001011011111101011101000111011011011101100101010001011010111100011100000011000000111101011101000111011011011100100101110001011100111100110100000101010000011100011100000101010000111101011101000111011010011100110101000101000111101011110 eba3b6eca8b5e38181eba3b6e4b8b9e682a0e382a1eba3b4e8b0b7eba3b6eca8b5e38181eba3b6e4b8b9e682a0e382a1eba3b4e6a28f5e
UHC 룶쨵ぁ룶丹悠ァ룴谷룶쨵ぁ룶丹悠ァ룴梏^ 10001111101010111010010010001111101010101010000110001111101010111101001110100001111010101110110110101011101000011000111110101001110011011101101110001111101010111010010010001111101010101010000110001111101010111101001110100001111010101110110110101011101000011000111110101001110011011101100101011110 8faba48faaa18fabd3a1eaedaba18fa9cddb8faba48faaa18fabd3a1eaedaba18fa9cdd95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)