To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????N}?????????N{^ 0011111100111111001111110011111100111111001111110011111100111111001111110100111001111101001111110011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 役??鎰??乙??N}役??鎰??乙??N{^ 1001011011110000001111110011111111101000010011000011111100111111100010011011001100111111001111110100111001111101100101101111000000111111001111111110100001001100001111110011111110001001101100110011111100111111010011100111101101011110 96f03f3fe84c3f3f89b33f3f4e7d96f03f3fe84c3f3f89b33f3f4e7b5e
EUC-JP 役??鎰??乙??N}役??鎰??乙??N{^ 1100110011110010001111110011111111101111101011010011111100111111101100101011010100111111001111110100111001111101110011001111001000111111001111111110111110101101001111110011111110110010101101010011111100111111010011100111101101011110 ccf23f3fefad3f3fb2b53f3f4e7dccf23f3fefad3f3fb2b53f3f4e7b5e
UTF-8 役대냵鎰곫찄乙쇱뒪N}役대냵鎰곫찄乙쇱뒪N{^ 1110010110111101101110011110101110001100100000001110101110000011101101011110100110001110101100001110101010110011101010111110110010110000100001001110010010111001100110011110110010000111101100011110101110010010101010100100111001111101111001011011110110111001111010111000110010000000111010111000001110110101111010011000111010110000111010101011001110101011111011001011000010000100111001001011100110011001111011001000011110110001111010111001001010101010010011100111101101011110 e5bdb9eb8c80eb83b5e98eb0eab3abecb084e4b999ec87b1eb92aa4e7de5bdb9eb8c80eb83b5e98eb0eab3abecb084e4b999ec87b1eb92aa4e7b5e
UHC 役대냵鎰곫찄乙쇱뒪N}役대냵鎰곫찄乙쇱뒪N{^ 1110011010110101101101001110101110000110100001011110110011110000100000011110011010101001100010001110101111100000101111001110110010001010101001000100111001111101111001101011010110110100111010111000011010000101111011001111000010000001111001101010100110001000111010111110000010111100111011001000101010100100010011100111101101011110 e6b5b4eb8685ecf081e6a988ebe0bcec8aa44e7de6b5b4eb8685ecf081e6a988ebe0bcec8aa44e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)