To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 癲앷풝援θぜ松쎌コ 1110011110011001101100101110110010010101101101111110110110010010100111011110011010001111101101001100111010111000111000111000000110011100111001101001110110111110111011001000111010001100111000111000001010110011 e799b2ec95b7ed929de68fb4ceb8e3819ce69dbeec8e8ce382b3
SJIS-WIN ???????????´?????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111110000001010011000011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f814c3f3f3f3f3f3f3f3f3f3f3f3f3f3f
EUC-JP ç??ì??í??æ?´Î¸ã??æ??ì??ã?? 1000111110101011101011100011111100111111100011111010101111000000001111110011111110001111101010111011111100111111001111111000111110101001110000010011111110100001101011011000111110101010110000101000111110100010101100011000111110101011101010100011111100111111100011111010100111000001001111110011111110001111101010111100000000111111001111111000111110101011101010100011111100111111 8fabae3f3f8fabc03f3f8fabbf3f3f8fa9c13fa1ad8faac28fa2b18fabaa3f3f8fa9c13f3f8fabc03f3f8fabaa3f3f
UTF-8 癲앷풝援θぜ松쎌コ 11000011101001111100001010011001110000101011001011000011101011001100001010010101110000101011011111000011101011011100001010010010110000101001110111000011101001101100001010001111110000101011010011000011100011101100001010111000110000111010001111000010100000011100001010011100110000111010011011000010100111011100001010111110110000111010110011000010100011101100001010001100110000111010001111000010100000101100001010110011 c3a7c299c2b2c3acc295c2b7c3adc292c29dc3a6c28fc2b4c38ec2b8c3a3c281c29cc3a6c29dc2bec3acc28ec28cc3a3c282c2b3
UHC ??²??·???æ?´?¸???æ?¾?????³ 00111111001111111010100111110111001111110011111110100001101001000011111100111111001111111010100110100001001111111010001010100101001111111010001010101100001111110011111100111111101010011010000100111111101010001111101000111111001111110011111100111111001111111010100111111000 3f3fa9f73f3fa1a43f3f3fa9a13fa2a53fa2ac3f3f3fa9a13fa8fa3f3f3f3f3fa9f8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)