To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?ぁ?い?懈い?杭?ぁ?い?懈い?恒^ 0011111110000010100111110011111110000010101000100011111110011100111001101000001010100010001111111000110101011001001111111000001010011111001111111000001010100010001111111001110011100110100000101010001000111111100011010101000001011110 3f829f3f82a23f9ce682a23f8d593f829f3f82a23f9ce682a23f8d505e
EUC-JP ?ぁ?い?懈い?杭?ぁ?い?懈い?恒^ 0011111110100100101000010011111110100100101001000011111111011000111010001010010010100100001111111011100110111010001111111010010010100001001111111010010010100100001111111101100011101000101001001010010000111111101110011011000101011110 3fa4a13fa4a43fd8e8a4a43fb9ba3fa4a13fa4a43fd8e8a4a43fb9b15e
UTF-8 룵ぁ캀い룫懈い룫杭룵ぁ캀い룫懈い룫恒^ 11101011101000111011010111100011100000011000000111101100101110101000000011100011100000011000010011101011101000111010101111100110100001111000100011100011100000011000010011101011101000111010101111100110100111011010110111101011101000111011010111100011100000011000000111101100101110101000000011100011100000011000010011101011101000111010101111100110100001111000100011100011100000011000010011101011101000111010101111100110100000011001001001011110 eba3b5e38181ecba80e38184eba3abe68788e38184eba3abe69dadeba3b5e38181ecba80e38184eba3abe68788e38184eba3abe681925e
UHC 룵ぁ캀い룫懈い룫杭룵ぁ캀い룫懈い룫恒^ 10001111101010101010101010100001101011111000111110101010101001001000111110100010111110101010101110101010101001001000111110100010111110011111100010001111101010101010101010100001101011111000111110101010101001001000111110100010111110101010101110101010101001001000111110100010111110011111011001011110 8faaaaa1af8faaa48fa2faabaaa48fa2f9f88faaaaa1af8faaa48fa2faabaaa48fa2f9f65e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)